Title | ||
---|---|---|
GlobalWoZ: Globalizing MultiWoZ to Develop Multilingual Task-Oriented Dialogue Systems |
Abstract | ||
---|---|---|
Over the last few years, there has been a move towards data curation for multilingual taskoriented dialogue (ToD) systems that can serve people speaking different languages. However, existing multilingual ToD datasets either have a limited coverage of languages due to the high cost of data curation, or ignore the fact that dialogue entities barely exist in countries speaking these languages. To tackle these limitations, we introduce a novel data curation method that generates GlobalWoZ - a largescale multilingual ToD dataset globalized from an English ToD dataset for three unexplored use cases of multilingual ToD systems. Our method is based on translating dialogue templates and filling them with local entities in the target-language countries. Besides, we extend the coverage of target languages to 20 languages. We will release our dataset and a set of strong baselines to encourage research on multilingual ToD systems for real use cases.(1) |
Year | DOI | Venue |
---|---|---|
2022 | 10.18653/v1/2022.acl-long.115 | PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS) |
DocType | Volume | Citations |
Conference | Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) | 0 |
PageRank | References | Authors |
0.34 | 0 | 7 |
Name | Order | Citations | PageRank |
---|---|---|---|
Bosheng Ding | 1 | 0 | 0.34 |
Junjie Hu | 2 | 112 | 20.50 |
Lidong Bing | 3 | 298 | 39.44 |
Sharifah Mahani Aljunied | 4 | 0 | 0.34 |
Shafiq R. Joty | 5 | 560 | 56.72 |
Luo Si | 6 | 2498 | 169.52 |
Chunyan Miao | 7 | 2307 | 195.72 |