Abstract | ||
---|---|---|
Machine reading comprehension (MRC) is a sub-field in natural language processing or computational linguistics. MRC aims to help computers understand unstructured texts and then answer questions related to them. In this paper, we present a new Vietnamese corpus for conversational machine reading comprehension (ViCoQA), consisting of 10,000 questions with answers over 2,000 conversations about health news articles. We analyze ViCoQA in depth with different linguistic aspects. Then, we evaluate several baseline models about dialogue and reading comprehension on the ViCoQA corpus. The best model obtains an F1 score of 45.27%, which is 30.91 points behind human performance (76.18%), indicating that there is ample room for improvement. |
Year | DOI | Venue |
---|---|---|
2021 | 10.1007/978-3-030-88113-9_44 | ICCCI |
DocType | Citations | PageRank |
Conference | 0 | 0.34 |
References | Authors | |
0 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Son T. Luu | 1 | 0 | 2.03 |
Mao Nguyen Bui | 2 | 0 | 0.34 |
Loi Duc Nguyen | 3 | 0 | 0.34 |
Khiem Vinh Tran | 4 | 0 | 1.01 |
Kiet Van Nguyen | 5 | 0 | 3.04 |
Ngan Luu-Thuy Nguyen | 6 | 0 | 4.06 |