Abstract | ||
---|---|---|
This paper proposes space diversity speech recognition technique using distributed multi-microphone in room, as a new paradigm of speech recognition. The key technology to realize the system is (1) distant-talking speech recognition and (2) the integration method of multiple inputs. In this paper, we propose the use of distant speech model for the distant-talking speech recognition, and feature-based and likelihood-based integration methods for multi-microphone distributed in room. The distant speech model is a set of HMMs learned using speech data convolved with the impulse responses measured at several points in room. The experimental results of simulated distant-talking speech recognition show that the proposed space diversity speech recognition system can attain about 80% in accuracy, while the performances of conventional HMM using close-talking microphone are less than 50%. These results indicate that the space diversity approach is promising for robust speech recognition under the real acoustic environment. |
Year | DOI | Venue |
---|---|---|
2000 | 10.1109/ICASSP.2000.862090 | 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI |
Keywords | Field | DocType |
mel frequency cepstral coefficient,loudspeakers,space technology,speech recognition,hidden markov models,accuracy,room,acoustical engineering,space diversity,hmm,background noise,impulse response,reverberation | Speech processing,Mel-frequency cepstrum,Speech coding,Pattern recognition,Voice activity detection,Computer science,Speech recognition,Speaker recognition,Artificial intelligence,Hidden Markov model,Linear predictive coding,Acoustic model | Conference |
ISSN | Citations | PageRank |
1520-6149 | 12 | 1.41 |
References | Authors | |
1 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yasuhiro Shimizu | 1 | 12 | 1.41 |
Shoji Kajita | 2 | 147 | 21.92 |
Kazuya Takeda | 3 | 1301 | 195.60 |
Fumitada Itakura | 4 | 431 | 67.73 |