Title
Entity set expansion using topic information
Abstract
This paper proposes three modules based on latent topics of documents for alleviating "semantic drift" in bootstrapping entity set expansion. These new modules are added to a discriminative bootstrapping algorithm to realize topic feature generation, negative example selection and entity candidate pruning. In this study, we model latent topics with LDA (Latent Dirichlet Allocation) in an unsupervised way. Experiments show that the accuracy of the extracted entities is improved by 6.7 to 28.2% depending on the domain.
Year
Venue
Keywords
2011
ACL (Short Papers)
bootstrapping entity set expansion,topic information,topic feature generation,model latent topic,latent topic,latent dirichlet allocation,new module,semantic drift,entity candidate pruning,discriminative bootstrapping algorithm,negative example selection
Field
DocType
Volume
Latent Dirichlet allocation,Computer science,Bootstrapping,Artificial intelligence,Set expansion,Natural language processing,Probabilistic latent semantic analysis,Feature generation,Discriminative model,Machine learning,Semantic change
Conference
P11-2
Citations 
PageRank 
References 
11
0.58
15
Authors
4
Name
Order
Citations
PageRank
Kugatsu Sadamitsu1427.40
Kuniko Saito2757.12
Kenji Imamura3425.60
Gen-ichiro Kikui430533.43