Data Augmentation For Abstractive Query-Focused Multi-Document Summarization - Citegraph

Paper Info

Title
Data Augmentation For Abstractive Query-Focused Multi-Document Summarization

Abstract
The progress in Query-focused Multi-Document Summarization (QMDS) has been limited by the lack of sufficient large-scale high-quality training datasets. We present two QMDS training datasets, which we construct using two data augmentation methods: (1) transferring the commonly used single-document CNN/Daily Mail summarization dataset to create the QMDSCNN dataset, and (2) mining search-query logs to create the QMDSIR dataset. These two datasets have complementary properties, i.e., QMDSCNN has real summaries but queries are simulated, while QMDSIR has real queries but simulated summaries. To cover both these real summary and query aspects, we build abstractive end-to-end neural network models on the combined datasets that yield new state-of-the-art transfer results on DUC datasets. We also introduce new hierarchical encoders that enable a more efficient encoding of the query together with multiple documents. Empirical results demonstrate that our data augmentation and encoding methods outperform baseline models on automatic metrics, as well as on human evaluations along multiple attributes.

Year	Venue	DocType
2021	THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE	Conference
Volume	ISSN	Citations
35	2159-5399	0
PageRank	References	Authors
0.34	0	7

Authors (7 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Ramakanth Pasunuru	1	25	3.69
Asli Çelikyilmaz	2	407	39.06
Michel Galley	3	2154	96.04
Chen-Yan Xiong	4	405	30.82
yizhe zhang	5	138	19.29
Mohit Bansal	6	871	63.19
Jianfeng Gao	7	5729	296.43

1