Title
Excitation Source Features For Improving The Detection Of Vowel Onset And Offset Points In A Speech Sequence
Abstract
The task of detecting the vowel regions in a given speech signal is a challenging problem. Over the years, several works on accurate detection of vowel regions and the corresponding vowel onset points (VOPs) and vowel end points (VEPs) have been reported. A novel front-end feature extraction technique exploiting the temporal and spectral characteristics of the excitation source information in the speech signal is proposed in this paper to improve the detection of vowel regions, VOPs and VEPs. To do the same, a three-class classifiers (vowel, non vowel and silence) is developed on the TIMIT database using the proposed features as well as mel-frequency cepstral coefficients (MFCC). Statistical modeling based on deep neural network has been employed for learning the parameters. Using the developed three-class classifier, a given speech sample is then forced aligned against the trained acoustic models to detect the vowel regions. The use of proposed feature results in detection of vowel regions quite different from those obtained through the MFCC. Exploiting the differences in the evidences obtained by using the two kinds of features, a technique to combine the evidences is also proposed in order to get a better estimate of the VOPs and VEPs.
Year
DOI
Venue
2017
10.21437/Interspeech.2017-135
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION
Keywords
Field
DocType
vowel recognition system, vowel onset point, vowel end point
Pattern recognition,Computer science,Speech recognition,Excitation,Artificial intelligence,Vowel,Offset (computer science)
Conference
ISSN
Citations 
PageRank 
2308-457X
2
0.37
References 
Authors
0
3
Name
Order
Citations
PageRank
G. Pradhan18813.14
Avinash Kumar253.79
S. Shahnawazuddin36417.34