Phonetically rich and balanced speech corpus for Arabic speaker-independent continuous automatic speech recognition systems - Citegraph

Paper Info

Title
Phonetically rich and balanced speech corpus for Arabic speaker-independent continuous automatic speech recognition systems

Abstract
This paper describes an efficient framework for designing and developing Arabic speaker-independent continuous automatic speech recognition systems based on a phonetically rich and balanced speech corpus. The speech corpus contains 415 sentences recorded by 42 (21 male and 21 female) Arabic native speakers from 11 Arab countries representing three major regions (Levant, Gulf, and Africa). The developed system is based on the Carnegie Mellon University (CMU) Sphinx tools. The Cambridge HTK tools were also used in some testing stages. The speech engine uses 3-emitting state Hidden Markov Models (HMM) for tri-phone based acoustic models. Based on experimental analysis of 4.07 hours of training speech data, the acoustic model used continuous observation's probability model of 16 Gaussian mixture distributions and the state distributions were tied to 400 senons. The language model contains both bi-grams and tri-grams. The system obtained 91.23% and 92.54% correct word recognition with and without diacritical marks respectively.

Year	DOI	Venue
2010	10.1109/ISSPA.2010.5605554	Information Sciences Signal Processing and their Applications
Keywords	Field	DocType
Gaussian processes,hidden Markov models,natural languages,speech processing,speech recognition,statistical distributions,Arabic speaker-independent continuous automatic speech recognition systems,Cambridge HTK tools,Carnegie Mellon University Sphinx tools,Gaussian mixture distributions,balanced speech corpus,continuous observation probability model,hidden Markov models,phonetically rich,state distributions,triphone based acoustic models,Acoustic model,Arabic Continuous Speech Recognition,Arabic speech corpus,Phonetically rich and balanced,Statistical language model	Speech corpus,Speech processing,Computer science,Artificial intelligence,Natural language processing,Language model,Pattern recognition,Word recognition,Speech recognition,Natural language,VoxForge,Hidden Markov model,Acoustic model	Conference
ISBN	Citations	PageRank
978-1-4244-7165-2	2	0.49
References	Authors
8	4

Authors (4 rows)

Cited by (2 rows)

References (8 rows)

Name	Order	Citations	PageRank
Mohammad A. M. Abushariah	1	47	6.02
Ainon, R.N.	2	2	0.49
Zainuddin, R.	3	2	0.49
Moustafa Elshafei	4	97	14.54

1