Title
InfoLM: A New Metric to Evaluate Summarization & Data2Text Generation.
Abstract
Assessing the quality of natural language generation (NLG) systems through human annotation is very expensive. Additionally, human annotation campaigns are time-consuming and include non-reusable human labour. In practice, researchers rely on automatic metrics as a proxy of quality. In the last decade, many string-based metrics (e.g., BLEU or ROUGE) have been introduced. However, such metrics usually rely on exact matches and thus, do not robustly handle synonyms. In this paper, we introduce InfoLM a family of untrained metrics that can be viewed as a string-based metric that addresses the aforementioned flaws thanks to a pre-trained masked language model. This family of metrics also makes use of information measures allowing the possibility to adapt InfoLM to different evaluation criteria. Using direct assessment, we demonstrate that InfoLM achieves statistically significant improvement and two figure correlation gains in many configurations compared to existing metrics on both summarization and data2text generation tasks.
Year
Venue
Keywords
2022
AAAI Conference on Artificial Intelligence
Speech & Natural Language Processing (SNLP),Machine Learning (ML)
DocType
ISSN
Citations 
Conference
AAAI 2022
0
PageRank 
References 
Authors
0.34
0
3
Name
Order
Citations
PageRank
Pierre Colombo104.06
Chloe Fan21086.53
Pablo Piantanida338955.41