Abstract | ||
---|---|---|
We present Neural Network (NN) approaches to the automated assessment of non-native spontaneous speech in a monologic task and a simulated dialogic task. Three attention-based Bidirectional Long Short-Term Memory (BLSTM) Recurrent Neural Networks (RNN) are employed to learn three dimensions (i.e., delivery, language use, and content) of scoring rubrics for the spoken responses. The prompts or turn history information are encoded to low-dimensional vectors by either a BLSTM-RNN or an end-to-end memory network (MemN2N) and used as the conditions of the inputs of the NN for rating the subscore of content. The three subscores are fused together to generate a holistic score. The experimental results show that our approaches significantly outperform the conventional approaches to speech scoring and the correlations of automatically predicted scores with the reference human scores are higher than human-human agreement levels for both tasks. |
Year | DOI | Venue |
---|---|---|
2019 | 10.1109/icassp.2019.8683717 | 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) |
Keywords | Field | DocType |
automated speech scoring, LSTM, RNN, attention, end-to-end memory networks | Dialogic,Rubric,Pattern recognition,Computer science,Recurrent neural network,Speech recognition,Artificial intelligence,Artificial neural network | Conference |
ISSN | Citations | PageRank |
1520-6149 | 0 | 0.34 |
References | Authors | |
0 | 7 |
Name | Order | Citations | PageRank |
---|---|---|---|
Qian Yao | 1 | 527 | 51.55 |
Patrick Lange | 2 | 9 | 8.42 |
Keelan Evanini | 3 | 79 | 20.23 |
Robert Pugh | 4 | 1 | 0.68 |
Rutuja Ubale | 5 | 2 | 3.17 |
Matthew Mulholland | 6 | 24 | 6.79 |
Xinhao Wang | 7 | 57 | 15.23 |