DNN-HMM Acoustic Modeling for Large Vocabulary Telugu Speech Recognition. - Citegraph

Paper Info

Title
DNN-HMM Acoustic Modeling for Large Vocabulary Telugu Speech Recognition.

Abstract
The main focus of this paper is towards the development of a large vocabulary Telugu speech database. Telugu is a low resource language where there exists no standardized database for building the speech recognition system (ASR). The database consists of neutral speech samples collected from 100 speakers for building the Telugu ASR system and it was named as IIIT-H Telugu speech corpus. The speech and text corpus design and the procedure followed for the collection of the database have been discussed in detail. The preliminary ASR system results for the models built in this database are reported. The architectural choices of deep neural networks (DNNs) play a crucial role in improving the performance of ASR systems. ASR trained with hybrid DNNs (DNN-HMM) with more hidden layers have shown better performance over the conventional GMMs (GMM-HMM). Kaldi tool kit is used for building the acoustic models required for the ASR system.

Year	Venue	Field
2017	MIKE	Speech corpus,Computer science,Text corpus,Speech recognition,Natural language processing,Artificial intelligence,VoxForge,Hidden Markov model,Vocabulary,Telugu,Deep neural networks
DocType	Citations	PageRank
Conference	0	0.34
References	Authors
9	6

Authors (6 rows)

Cited by (0 rows)

References (9 rows)

Name	Order	Citations	PageRank
Vishnu Vidyadhara Raju Vegesna	1	1	0.69
Krishna Gurugubelli	2	8	5.45
hari krishna vydana	3	16	4.67
Bhargav Pulugandla	4	0	0.34
Manish Shrivastava	5	19	23.49
Anil Kumar Vuppala	6	27	5.71

1