Title
ArabCeleb: Speaker Recognition in Arabic
Abstract
Due to the growing interest in speech recognition technologies, several datasets of speech acquired under uncontrolled conditions have been proposed in recent years. The majority of the datasets available to the community are in English, which reduces the possibility of developing and evaluating recognition technologies in languages other than English. In this paper we try to reduce this language-related gap by proposing a dataset for Arabic language speech recognition. The dataset is made available to the community and contains 100 speakers of both genders. Experiments with some of the latest speaker recognition approaches have been performed both with and without a suitable training on the Arabic language. Results suggest that, to effectively develop recognition technologies in other languages, suitable data for that language are necessary to allow at least a transfer learning approach. In particular, such data is crucial when short utterances are considered.
Year
DOI
Venue
2021
10.1007/978-3-031-08421-8_23
AIxIA 2021 – Advances in Artificial Intelligence
Keywords
DocType
ISSN
Speaker recognition, Arabic language, Dataset
Conference
0302-9743
Citations 
PageRank 
References 
0
0.34
0
Authors
8
Name
Order
Citations
PageRank
Simone Bianco122624.48
Celona Luigi200.34
Khalifa Intissar300.34
Napoletano Paolo400.34
Petrovsky Alexey500.34
Piccoli Flavio600.34
Raimondo Schettini71476154.06
Shanin Ivan800.34