Title
Multi-Task Learning for Abstractive and Extractive Summarization.
Abstract
The abstractive method and extractive method are two main approaches for automatic document summarization. In this paper, to fully integrate the relatedness and advantages of both approaches, we propose a general unified framework for abstractive summarization which incorporates extractive summarization as an auxiliary task. In particular, our framework is composed of a shared hierarchical document encoder, a hierarchical attention mechanism-based decoder, and an extractor. We adopt multi-task learning method to train these two tasks jointly, which enables the shared encoder to better capture the semantics of the document. Moreover, as our main task is abstractive summarization, we constrain the attention learned in the abstractive task with the labels of the extractive task to strengthen the consistency between the two tasks. Experiments on the CNN/DailyMail dataset demonstrate that both the auxiliary task and the attention constraint contribute to improve the performance significantly, and our model is comparable to the state-of-the-art abstractive models. In addition, we cut half number of labels of the extractive task, pretrain the extractor, and jointly train the two tasks using the estimated sentence salience of the extractive task to constrain the attention of the abstractive task. The results do not decrease much compared with using full-labeled data of the auxiliary task.
Year
DOI
Venue
2019
10.1007/s41019-019-0087-7
Data Science and Engineering
Keywords
Field
DocType
Automatic document summarization, Multi-task learning, Attention mechanism
Automatic summarization,Multi-task learning,Computer science,Document summarization,Extractor,Encoder,Artificial intelligence,Salience (language),Sentence,Machine learning,Semantics
Journal
Volume
Issue
ISSN
4
1
2364-1185
Citations 
PageRank 
References 
1
0.37
2
Authors
4
Name
Order
Citations
PageRank
Yangbin Chen120.72
Yun Ma2255.82
Xudong Mao310510.64
Qing Li43222433.87