Title
MCut: a thresholding strategy for multi-label classification
Abstract
The multi-label classification is a frequent task in machine learning notably in text categorization. When binary classifiers are not suited, an alternative consists in using a multiclass classifier that provides for each document a score per category and then in applying a thresholding strategy in order to select the set of categories which must be assigned to the document. The common thresholding strategies, such as RCut, PCut and SCut methods, need a training step to determine the value of the threshold. To overcome this limit, we propose a new strategy, called MCut which automatically estimates a value for the threshold. This method does not have to be trained and does not need any parametrization. Experiments performed on two textual corpora, XML Mining 2009 and RCV1 collections, show that the MCut strategy results are on par with the state of the art but MCut is easy to implement and parameter free.
Year
DOI
Venue
2012
10.1007/978-3-642-34156-4_17
IDA
Keywords
Field
DocType
binary classifier,xml mining,rcv1 collection,thresholding strategy,new strategy,common thresholding strategy,mcut strategy result,multi-label classification,scut method,frequent task
Pattern recognition,XML,Computer science,Support vector machine,Multi-label classification,Artificial intelligence,Thresholding,Text categorization,Classifier (linguistics),Machine learning,Binary number
Conference
Citations 
PageRank 
References 
3
0.40
22
Authors
3
Name
Order
Citations
PageRank
Christine Largeron114830.40
Christophe Moulin2505.01
Mathias Géry313737.23