Title
Non-Local Estimation Of Speech Signal For Vowel Onset Point Detection In Varied Environments
Abstract
Vowel onset point (VOP) is an important information extensively employed in speech analysis and synthesis. Detecting the VOPs in a given speech sequence, independent of the text contexts and recording environments, is a challenging area of research. Performance of existing VOP detection methods have not yet been extensively studied in varied environmental conditions. In this paper, we have exploited the non-local means estimation to detect those regions in the speech sequence which are of high signal-to-noise ratio and exhibit periodicity. Mostly, those regions happen to be the vowel regions. This helps in overcoming the ill-effects of environmental degradations. Next, for each short-time frame of estimated speech sequence, we cumulatively sum the magnitude of the corresponding Fourier transform spectrum. The cumulative sum is then used as the feature to detect the VOPs. The experiments conducted on TIMIT database show that the proposed approach provides better results in terms of detection and spurious rate when compared to a few existing methods under clean and noisy test conditions.
Year
DOI
Venue
2017
10.21437/Interspeech.2017-624
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION
Keywords
Field
DocType
non-local means estimation, vowel region, vowel onset point
Pattern recognition,Vowel onset point,Computer science,Speech recognition,Artificial intelligence
Conference
ISSN
Citations 
PageRank 
2308-457X
1
0.35
References 
Authors
0
3
Name
Order
Citations
PageRank
Avinash Kumar153.79
S. Shahnawazuddin26417.34
G. Pradhan38813.14