Title
The SRI-ICSI Spring 2007 Meeting and Lecture Recognition System
Abstract
We describe the latest version of the SRI-ICSI meeting and lecture recognition system, as was used in the NIST RT-07 evaluations, highlighting improvements made over the last year. Changes in the acoustic preprocessing include updated beamforming software for processing of multiple distant microphones, and various adjustments to the speech segmenter for close-talking microphones. Acoustic models were improved by the combined use of neural-net-estimated phone posterior features, discriminative feature transforms trained with fMPE-MAP, and discriminative Gaussian estimation using MPE-MAP, as well as model adaptation specifically to nonnative and non-American speakers. The net effect of these enhancements was a 14-16% relative error reduction on distant microphones, and a 16-17% error reduction on close-talking microphones. Also, for the first time, we report results on a new "coffee break" meeting genre, and on a new NIST metric designed to evaluate combined speech diarization and recognition.
Year
DOI
Venue
2007
10.1007/978-3-540-68585-2_42
CLEAR
Keywords
DocType
Volume
lecture recognition system,discriminative feature,combined speech diarization,sri-icsi spring,nist rt-07 evaluation,acoustic preprocessing,close-talking microphone,distant microphone,sri-icsi meeting,combined use,acoustic model,discriminative gaussian estimation
Conference
4625
ISSN
Citations 
PageRank 
0302-9743
26
1.79
References 
Authors
12
8
Name
Order
Citations
PageRank
Andreas Stolcke16690712.46
Xavier Anguera262454.28
Kofi Boakye315513.64
Özgür Çetin415414.41
Adam Janin525034.11
Mathew Magimai-Doss651654.76
Chuck Wooters740458.49
Jing Zheng844243.00