TY - GEN
T1 - A Hybrid Thresholding Strategy combining RCut and PCut for Multi-label Classification
AU - Ghawi, Raji
AU - Pfeffer, Juergen
N1 - Publisher Copyright:
© 2021 ACM.
PY - 2021/11/29
Y1 - 2021/11/29
N2 - Multi-label classification is a variant of the classification problem where multiple labels may be assigned to each instance. Usually multi-label classification algorithms output a numerical score for each label, indicative of their relevance to a query instance. However, in many applications the desired output is a bipartition of the labels into relevant and irrelevant w.r.t the query instance. Bipartitions can be obtained from scores using various thresholding strategies, such as PCut strategy which selects relevant instances per label, and RCut strategy which selects relevant labels per instance. However, we suggest that a combination of both strategies would provide better classification performance. In this paper, we propose a fuzzy-based approach to combine PCut and RCut strategies, by converting the crisp relevance into fuzzy one, merging them linearly, and defuzzifying again. Our experiments shows that our hybrid approach indeed outperforms both strategies.
AB - Multi-label classification is a variant of the classification problem where multiple labels may be assigned to each instance. Usually multi-label classification algorithms output a numerical score for each label, indicative of their relevance to a query instance. However, in many applications the desired output is a bipartition of the labels into relevant and irrelevant w.r.t the query instance. Bipartitions can be obtained from scores using various thresholding strategies, such as PCut strategy which selects relevant instances per label, and RCut strategy which selects relevant labels per instance. However, we suggest that a combination of both strategies would provide better classification performance. In this paper, we propose a fuzzy-based approach to combine PCut and RCut strategies, by converting the crisp relevance into fuzzy one, merging them linearly, and defuzzifying again. Our experiments shows that our hybrid approach indeed outperforms both strategies.
KW - fuzzy-logic
KW - multilabel classification
KW - thresholding strategy
UR - http://www.scopus.com/inward/record.url?scp=85122622959&partnerID=8YFLogxK
U2 - 10.1145/3487664.3487702
DO - 10.1145/3487664.3487702
M3 - Conference contribution
AN - SCOPUS:85122622959
T3 - ACM International Conference Proceeding Series
SP - 278
EP - 287
BT - 23rd International Conference on Information Integration and Web Intelligence, iiWAS 2021 - Proceedings
A2 - Pardede, Eric
A2 - Santiago, Maria-Indrawan
A2 - Haghighi, Pari Delir
A2 - Steinbauer, Matthias
A2 - Khalil, Ismail
A2 - Kotsis, Gabriele
PB - Association for Computing Machinery
T2 - 23rd International Conference on Information Integration and Web Intelligence, iiWAS 2021
Y2 - 29 November 2021 through 1 December 2021
ER -