Title
Cross-Lingual Text Categorization.
Abstract
This article deals with the problem of Cross-Lingual Text Categorization (CLTC), which arises when documents in different languages must be classified according to the same classification tree. We describe practical and cost-effective solutions for automatic Cross-Lingual Text Categorization, both in case a sufficient number of training examples is available for each new language and in the case that for some language no training examples are available. Experimental results of the bi-lingual classification of the ILO corpus (with documents in English and Spanish) are obtained using bi-lingual training, terminology translation and profile-based translation.
Year
DOI
Venue
2003
10.1007/b11967
Lecture Notes in Computer Science
Field
DocType
Volume
CLTC,Categorization,Cross lingual,Query expansion,Computer science,Natural language processing,Language identification,Artificial intelligence,Constructed language,Text categorization,Decision tree learning
Conference
2769
ISSN
Citations 
PageRank 
0302-9743
41
2.67
References 
Authors
14
3
Name
Order
Citations
PageRank
Núria Bel1544.57
Cornelis H. A. Koster228157.53
Marta Villegas37213.44