Rare category exploration - Citegraph

Paper Info

Title
Rare category exploration

Abstract
Rare category discovery aims at identifying unlabeled data examples of rare categories in a given data set. The existing approaches to rare category discovery often need a certain number of labeled data examples as the training set, which are usually difficult and expensive to acquire in practice. To save the cost however, if these methods only use a small training set, their accuracy may not be satisfactory for real applications. In this paper, for the first time, we propose the concept of rare category exploration, aiming to discover all data examples of a rare category from a seed (which is a labeled data example of this rare category) instead of from a training set. To this end, we present an approach known as the FRANK algorithm which transforms rare category exploration to local community detection from a seed in a kNN (k-nearest neighbors) graph with an automatically selected k value. Extensive experimental results on real data sets verify the effectiveness and efficiency of our FRANK algorithm.

Year	DOI	Venue
2014	10.1016/j.eswa.2013.12.039	Expert Syst. Appl.
Keywords	Field	DocType
real application,rare category exploration,training set,rare category discovery,rare category,frank algorithm,data example,unlabeled data example,small training set	Training set,Graph,Data mining,Data set,Computer science,Artificial intelligence,Labeled data,Machine learning	Journal
Volume	Issue	ISSN
41	9	0957-4174
Citations	PageRank	References
8	0.54	25
Authors
5

Authors (5 rows)

Cited by (8 rows)

References (25 rows)

Name	Order	Citations	PageRank
Hao Huang	1	89	7.77
Kevin Chiew	2	116	11.06
Yunjun Gao	3	862	89.71
Qinming He	4	371	41.53
Qing Li	5	3222	433.87

1