Iterative Refining of Category Profiles for Nearest Centroid Cross-Domain Text Classification. - Citegraph

Paper Info

Title
Iterative Refining of Category Profiles for Nearest Centroid Cross-Domain Text Classification.

Abstract
In cross-domain text classification, topic labels for documents of a target domain are predicted by leveraging knowledge of labeled documents of a source domain, having equal or similar topics with possibly different words. Existing methods either adapt documents of the source domain to the target or represent both domains in a common space. These methods are mostly based on advanced statistical techniques and often require tuning of parameters in order to obtain optimal performances. We propose a more straightforward approach based on nearest centroid classification: profiles of topic categories are extracted from the source domain and are then adapted by iterative refining steps using most similar documents in the target domain. Experiments on common benchmark datasets show that this approach, despite its simplicity, obtains accuracy measures better or comparable to other methods, obtained with fixed empirical values for its few parameters.

Year	DOI	Venue
2014	10.1007/978-3-319-25840-9_4	Communications in Computer and Information Science
Field	DocType	Volume
Data mining,Pattern recognition,Computer science,Artificial intelligence,Centroid,Refining (metallurgy)	Conference	553
ISSN	Citations	PageRank
1865-0929	4	0.38
References	Authors
20	4

Authors (4 rows)

Cited by (4 rows)

References (20 rows)

Name	Order	Citations	PageRank
Giacomo Domeniconi	1	6	1.07
G. Moro	2	192	16.25
Roberto Pasolini	3	8	1.77
Claudio Sartori	4	23	3.94

1