Title | ||
---|---|---|
Topic-driven multi-document summarization with encyclopedic knowledge and spreading activation |
Abstract | ||
---|---|---|
Information of interest to users is often distributed over a set of documents. Users can specify their request for information as a query/topic -- a set of one or more sentences or questions. Producing a good summary of the relevant information relies on understanding the query and linking it with the associated set of documents. To "understand" the query we expand it using encyclopedic knowledge in Wikipedia. The expanded query is linked with its associated documents through spreading activation in a graph that represents words and their grammatical connections in these documents. The topic expanded words and activated nodes in the graph are used to produce an extractive summary. The method proposed is tested on the DUC summarization data. The system implemented ranks high compared to the participating systems in the DUC competitions, confirming our hypothesis that encyclopedic knowledge is a useful addition to a summarization system. |
Year | Venue | Keywords |
---|---|---|
2008 | EMNLP | summarization system,extractive summary,expanded query,encyclopedic knowledge,activated node,good summary,associated document,duc summarization data,duc competition,relevant information,topic-driven multi-document summarization,multi document summarization,spreading activation |
Field | DocType | Volume |
Automatic summarization,Graph,Multi-document summarization,Information retrieval,Computer science,Request for information,Artificial intelligence,Natural language processing | Conference | D08-1 |
Citations | PageRank | References |
56 | 2.23 | 16 |
Authors | ||
1 |
Name | Order | Citations | PageRank |
---|---|---|---|
Vivi Nastase | 1 | 523 | 41.30 |