Title
Intelligent multi-document summarization for biomedical literature by word embeddings and graph-based ranking.
Abstract
With the rapid development of clinical and laboratory medicine, the field of bioinformatics boasts of extensive clinical records and research literature. Retrieving effective information from this huge data has become a challenging task. Hence, Intelligent text summarization, which enables users to find and understand relevant source texts more quickly and effortlessly, becomes a very significant and valuable field of research. In this study, we propose an improved TextRank algorithm with weight calculation based on sentence graph to solve this problem. For the experimental dataset obtained from Pubmed, we represent terms as vectors by using Skip-gram model. We design three methods which utilize word embeddings to calculate weights between sentences. Then we build an undirected graph with sentences as nodes. At last, we use the improved TextRank algorithm to calculate the importance of sentences and further generated summarizations base on its ranking. The experimental results and analysis on the datasets demonstrate the effectiveness of the proposed model.
Year
DOI
Venue
2019
10.3233/JIFS-179315
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS
Keywords
Field
DocType
Intelligent,text summarization,graph-based ranking,similarity calculation
Multi-document summarization,Graph,Ranking,Artificial intelligence,Natural language processing,Machine learning,Mathematics
Journal
Volume
Issue
ISSN
37
4.0
1064-1246
Citations 
PageRank 
References 
0
0.34
0
Authors
6
Name
Order
Citations
PageRank
Chen Shen100.68
Hongfei Lin2768122.52
Huihui Hao300.68
Zhihao Yang427036.04
Jian Wang511218.98
Shao-Wu Zhang618934.00