What is wrong with you?: Leveraging User Sentiment for Automatic Dialog Evaluation - Citegraph

Paper Info

Title
What is wrong with you?: Leveraging User Sentiment for Automatic Dialog Evaluation

Abstract
Accurate automatic evaluation metrics for open-domain dialogs are in high demand. Existing model-based metrics for system response evaluation are trained on human annotated data, which is cumbersome to collect. In this work, we propose to use information that can be automatically extracted from the next user utterance, such as its sentiment or whether the user explicitly ends the conversation, as a proxy to measure the quality of the previous system response. This allows us to train on a massive set of dialogs with weak supervision, without requiring manual system turn quality annotations. Experiments show that our model is comparable to models trained on human annotated data. Furthermore, our model generalizes across both spoken and written opendomain dialog corpora collected from real and paid users.

Year	DOI	Venue
2022	10.18653/v1/2022.findings-acl.331	FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022)
DocType	Volume	Citations
Conference	Findings of the Association for Computational Linguistics: ACL 2022	0
PageRank	References	Authors
0.34	0	5

Authors (5 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Sarik Ghazarian	1	0	2.03
Behnam Hedayatnia	2	0	0.34
Alexandros Papangelis	3	93	18.01
Yang Liu	4	945	70.67
Dilek Hakkani-Tür	5	1024	85.05

1