Title
Using a Model of the Cochlea Based in the Micro and Macro Mechanical to Find Parameters for Automatic Speech Recognition
Abstract
Recently the parametric representation using cochlea behavior has been used in different studies related with Automatic Speech Recognition (ASR). That is because this important organ of the hearing in the mammalians is the principal element used to make a transduction of the sound pressure that is received by the ear. In this paper we show how the macro and micro mechanical model is used in ASR tasks. We used the values that Neely founded in his work, related with the macro and micro mechanical model, such as was named, to set the central frequencies of a bank filter to obtain parameters from the speech used in a similar form as MFCC were constructed. We propose a new approach that considers a new form to construct the bank filter in our parametric representation. Then we used this distribution of the bank filter to have a new representation of the speech in frequency domain. It is important indicate that MFCC parameters use Mel scale to create a bank filter where central frequencies of each filter is in function of the scale mentioned above. We used the response of the Neely's model behavior to create the central frequencies of the bank filter mentioned above, then we substitute the Mel scale function by another representation. We use the place theory, and we reach a 98.5% of performance, for a task that uses isolated digits pronounced by 5 different speakers. Neely's model was used because a set of parameters of the cochlea as mass, damping and stiffness, among others, when are substituted inside the model make the response obtained is closer than von B茅k茅sy proposed in his preliminary work about principle function of the cochlea.
Year
DOI
Venue
2013
10.1109/MICAI.2013.39
MICAI (Special Sessions)
Keywords
Field
DocType
mel scale function,cochlea behavior,central frequency,model behavior,mel scale,bank filter,parametric representation,macro mechanical,automatic speech recognition,new representation,new approach,micro mechanical model,damping,speech recognition
Frequency domain,Sound pressure,Mel-frequency cepstrum,Pattern recognition,Stiffness,Computer science,Speech recognition,Mel scale,Parametric statistics,Artificial intelligence,Macro,Hidden Markov model
Conference
Citations 
PageRank 
References 
0
0.34
1
Authors
2