Abstract | ||
---|---|---|
Due to the growing interest in speech recognition technologies, several datasets of speech acquired under uncontrolled conditions have been proposed in recent years. The majority of the datasets available to the community are in English, which reduces the possibility of developing and evaluating recognition technologies in languages other than English. In this paper we try to reduce this language-related gap by proposing a dataset for Arabic language speech recognition. The dataset is made available to the community and contains 100 speakers of both genders. Experiments with some of the latest speaker recognition approaches have been performed both with and without a suitable training on the Arabic language. Results suggest that, to effectively develop recognition technologies in other languages, suitable data for that language are necessary to allow at least a transfer learning approach. In particular, such data is crucial when short utterances are considered. |
Year | DOI | Venue |
---|---|---|
2021 | 10.1007/978-3-031-08421-8_23 | AIxIA 2021 – Advances in Artificial Intelligence |
Keywords | DocType | ISSN |
Speaker recognition, Arabic language, Dataset | Conference | 0302-9743 |
Citations | PageRank | References |
0 | 0.34 | 0 |
Authors | ||
8 |
Name | Order | Citations | PageRank |
---|---|---|---|
Simone Bianco | 1 | 226 | 24.48 |
Celona Luigi | 2 | 0 | 0.34 |
Khalifa Intissar | 3 | 0 | 0.34 |
Napoletano Paolo | 4 | 0 | 0.34 |
Petrovsky Alexey | 5 | 0 | 0.34 |
Piccoli Flavio | 6 | 0 | 0.34 |
Raimondo Schettini | 7 | 1476 | 154.06 |
Shanin Ivan | 8 | 0 | 0.34 |