Title
Centroid prior topic model for multi-label classification
Abstract
Supervised topic models such as labeled latent Dirichlet allocation (L-LDA) have attracted increased attention for multi-label classification. However, they lack considerations of the label frequency of the word (i.e., the number of labels containing the word), which is crucial for classification. To address this problem, we investigate the L-LDA model and then propose an extension, namely centroid prior topic model (CTPM). Class-feature-centroid (CFC) suggests a discriminative label-word vector that takes the label frequency of the word into account. CPTM uses this CFC vector as prior for label-word distributions. Extensive experiments on the Yahoo! dataset have been conducted to evaluate our algorithm. The experimental results demonstrate that CPTM outperforms the existing multi-label classification algorithms on AUC, Macro-F1 and Micro-F1.
Year
DOI
Venue
2015
10.1016/j.patrec.2015.04.012
Pattern Recognition Letters
Keywords
Field
DocType
Multi-label classification,Topic model,Labeled latent Dirichlet allocation,Class-feature-centroid
Latent Dirichlet allocation,Pattern recognition,Multi-label classification,Artificial intelligence,Topic model,Statistical classification,Discriminative model,Centroid,Machine learning,Mathematics
Journal
Volume
Issue
ISSN
62
C
0167-8655
Citations 
PageRank 
References 
1
0.35
26
Authors
3
Name
Order
Citations
PageRank
Ximing Li14413.97
Jihong OuYang29415.66
Xiaotang Zhou3194.08