Topic-based Evaluation for Conversational Bots. - Citegraph

Paper Info

Title
Topic-based Evaluation for Conversational Bots.

Abstract
Dialog evaluation is a challenging problem, especially for non task-oriented dialogs where conversational success is not well-defined. We propose to evaluate dialog quality using topic-based metrics that describe the ability of a conversational bot to sustain coherent and engaging conversations on a topic, and the diversity of topics that a bot can handle. To detect conversation topics per utterance, we adopt Deep Average Networks (DAN) and train a topic classifier on a variety of question and query data categorized into multiple topics. We propose a novel extension to DAN by adding a topic-word attention table that allows the system to jointly capture topic keywords in an utterance and perform topic classification. We compare our proposed topic based metrics with the ratings provided by users and show that our metrics both correlate with and complement human judgment. Our analysis is performed on tens of thousands of real human-bot dialogs from the Alexa Prize competition and highlights user expectations for conversational bots.

Year	Venue	Field
2018	arXiv: Computation and Language	Dialog box,User expectations,Conversation,Computer science,Utterance,Human judgment,Artificial intelligence,Natural language processing,Classifier (linguistics)
DocType	Volume	ISSN
Journal	abs/1801.03622	Nips.Workshop.ConversationalAI 2017-12-08
Citations	PageRank	References
3	0.39	14
Authors
6

Authors (6 rows)

Cited by (3 rows)

References (14 rows)

Name	Order	Citations	PageRank
Fenfei Guo	1	7	1.52
Angeliki Metallinou	2	245	16.39
Chandra Khatri	3	16	3.03
Anirudh Raju	4	14	3.05
Anu Venkatesh	5	3	0.39
Ashwin Ram	6	1087	187.96

1