OLR 2021 CHALLENGE: DATASETS, RULES AND BASELINES - Citegraph

Paper Info

Title
OLR 2021 CHALLENGE: DATASETS, RULES AND BASELINES

Abstract
This paper introduces the sixth Oriental Language Recognition (OLR) 2021 Challenge, which intends to improve the performance of language recognition systems and speech recognition systems within multilingual scenarios. The data profile, four tasks, two baselines, and the evaluation principles are introduced in this paper. In addition to the Language Identification (LID) tasks, multilingual Automatic Speech Recognition (ASR) tasks are introduced to OLR 2021 Challenge for the first time. The challenge this year focuses on more practical and challenging problems, with four tasks: (1) constrained LID, (2) unconstrained LID, (3) constrained multilingual ASR, (4) unconstrained multilingual ASR. Baselines for LID tasks and multilingual ASR tasks are provided, respectively. The LID baseline system is an extended TDNN x-vector model constructed with Pytorch. A transformer-based end-to-end model is provided as the multilingual ASR baseline system. These recipes will be online published, and available for participants to construct their own LID or ASR systems. The baseline results demonstrate that those tasks are rather challenging and deserve more effort to achieve better performance.

Year	Venue	Keywords
2021	2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC)	language recognition, language identification, multilingual automatic speech recognition, oriental language, OLR 2021 Challenge
DocType	ISSN	Citations
Conference	2309-9402	0
PageRank	References	Authors
0.34	0	10

Authors (10 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Binling Wang	1	0	0.34
Wenxuan Hu	2	0	0.34
Jing Li	3	0	0.34
Yiming Zhi	4	0	0.34
Zheng Li	5	0	1.35
Qingyang Hong	6	0	0.34
Lin Li	7	0	0.34
Dong Wang	8	0	0.34
Liming Song	9	2	0.72
Cheng Yang	10	0	0.34

1