Abstract | ||
---|---|---|
It is a necessary but challenging task to relieve users from the proliferative news information and allow them to quickly and comprehensively master the information of the whats and hows that are happening in the world every day. In this article, we develop a novel approach of multimedia news summarization for searching results on the Internet, which uncovers the underlying topics among query-related news information and threads the news events within each topic to generate a query-related brief overview. First, the hierarchical latent Dirichlet allocation (hLDA) model is introduced to discover the hierarchical topic structure from query-related news documents, and a new approach based on the weighted aggregation and max pooling is proposed to identify one representative news article for each topic. One representative image is also selected to visualize each topic as a complement to the text information. Given the representative documents selected for each topic, a time-bias maximum spanning tree (MST) algorithm is proposed to thread them into a coherent and compact summary of their parent topic. Finally, we design a friendly interface to present users with the hierarchical summarization of their required news information. Extensive experiments conducted on a large-scale news dataset collected from multiple news Web sites demonstrate the encouraging performance of the proposed solution for news summarization in news retrieval. |
Year | DOI | Venue |
---|---|---|
2016 | 10.1145/2822907 | ACM TIST |
Keywords | Field | DocType |
Design,Algorithms,Performance,Human Factors,News summarization,topic structure,multimodal,hierarchical latent Dirichlet allocation,maximum spanning tree | Topic structure,Data mining,Multi-document summarization,Automatic summarization,Latent Dirichlet allocation,World Wide Web,Information retrieval,Computer science,Pooling,Spanning tree,Multimedia,The Internet | Journal |
Volume | Issue | ISSN |
7 | 3 | 2157-6904 |
Citations | PageRank | References |
6 | 0.39 | 22 |
Authors | ||
5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Zechao Li | 1 | 1375 | 57.59 |
Jinhui Tang | 2 | 5180 | 212.18 |
Xueming Wang | 3 | 13 | 1.60 |
Jing Liu | 4 | 1781 | 88.09 |
Hanqing Lu | 5 | 4620 | 291.38 |