Abstract | ||
---|---|---|
Topic models employ the Bag-of-Words (BOW) representation, which break terms into constituent words and treat words as surface strings without assuming predefined knowledge about word meaning. In this paper, we propose the Semantic Concept Latent Dirichlet Allocation (SCLDA) and Semantic Concept Hierarchical Dirichlet Process (SCHDP) based approaches by representing text as meaningful concepts rather than words, using a new model known as Bag-of-Concepts (BOC). We propose new algorithms of applying SCLDA and SCHDP into the Concept Chain Queries (CCQ) problem. The algorithms are focused on discovering new semantic relationships between two concepts across documents where relationships found reveal semantic paths linking two concepts across multiple text units. The experiments demonstrate the search quality has been greatly improved, compared with using other LDA or HDP based approaches. |
Year | DOI | Venue |
---|---|---|
2016 | 10.1109/ICMLA.2016.0026 | 2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA) |
Keywords | Field | DocType |
Knowledge Discovery,Topic model,Latent Dirichlet Allocation,Hierarchical Dirichlet Process,Bag-of-Concepts | Semantic similarity,Hierarchical Dirichlet process,Latent Dirichlet allocation,Information retrieval,Semantic search,Computer science,Knowledge extraction,Artificial intelligence,Natural language processing,Topic model,Semantic computing,Machine learning | Conference |
ISBN | Citations | PageRank |
978-1-5090-6168-6 | 0 | 0.34 |
References | Authors | |
13 | 2 |