Title
Document summarization using conditional random fields
Abstract
Many methods, including supervised and unsupervised algorithms, have been developed for extractive document summarization. Most supervised methods consider the summarization task as a two-class classification problem and classify each sentence individually without leveraging the relationship among sentences. The unsupervised methods use heuristic rules to select the most informative sentences into a summary directly, which are hard to generalize. In this paper, we present a Conditional Random Fields (CRF) based framework to keep the merits of the above two kinds of approaches while avoiding their disadvantages. What is more, the proposed framework can take the outcomes of previous methods as features and seamlessly integrate them. The key idea of our approach is to treat the summarization task as a sequence labeling problem. In this view, each document is a sequence of sentences and the summarization procedure labels the sentences by 1 and 0. The label of a sentence depends on the assignment of labels of others. We compared our proposed approach with eight existing methods on an open benchmark data set. The results show that our approach can improve the performance by more than 7.1% and 12.1% over the best supervised baseline and unsupervised baseline respectively in terms of two popular metrics F1 and ROUGE-2. Detailed analysis of the improvement is presented as well.
Year
DOI
Venue
2007
null
IJCAI
Keywords
Field
DocType
conditional random field,supervised method,extractive document summarization,unsupervised algorithm,summarization task,unsupervised method,informative sentence,supervised baseline,summarization procedure,unsupervised baseline,machine learning
Conditional random field,Automatic summarization,Heuristic,Sequence labeling,Computer science,Document summarization,Artificial intelligence,Sentence,Machine learning
Conference
Volume
Issue
ISSN
null
null
null
Citations 
PageRank 
References 
156
4.88
24
Authors
5
Search Limit
100156
Name
Order
Citations
PageRank
Dou Shen1122459.46
Jian-Tao Sun2162974.03
Hua Li357925.22
Qiang Yang417039875.69
Zheng Chen55019256.89