Title
Topic Discovery for Biomedical Corpus Using MeSH Embeddings
Abstract
Discovering latent topics from biomedical documents has become a pivotal task in many biomedical text mining applications. Medical Subject Headings (MeSH) terms, which are curated by human experts, provide highly precise keyword representations for biomedical documents. However, the performance of conventional topic models on MeSH documents is usually unsatisfying due to the limited length of individual MeSH documents. In this paper, we propose a novel topic model for MeSH documents using MeSH embeddings. The proposed topic model is able to overcome the lack of context information problem in MeSH documents by 1) exploiting the rich term-level co-occurrence patterns instead of the sparse document-level co-occurrence patterns, and 2) incorporating additional MeSH semantics in MeSH embeddings learned from a large external biomedical knowledge base. Experimental result on a real-world biomedical dataset shows the efficacy of the proposed model in discovering coherent topics from MeSH documents.
Year
DOI
Venue
2019
10.1109/BHI.2019.8834559
2019 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI)
Keywords
Field
DocType
Biomedical topic discovery,MeSH embeddings
Information retrieval,Computer science,Biomedical text mining,Knowledge base,Topic model,Semantics
Conference
ISSN
ISBN
Citations 
2641-3590
978-1-7281-0849-0
0
PageRank 
References 
Authors
0.34
7
4
Name
Order
Citations
PageRank
Guangxu Xun110911.89
Kishlay Jha2497.83
Ye Yuan392.31
Aidong Zhang42970405.63