Abstract | ||
---|---|---|
We present a conversational telephone speech data set designed to support research on novel acoustic models. Small vocabulary tasks from 10 words up to 500 words are defined using subsets of the Switchboard-1 corpus; each task has a completely closed vocabulary (an OOV rate of 0%). We justify the need for these tasks, de- scribe the algorithm for selecting them from a large cor- pus, give a statistical analysis of the data and present baseline whole-word hidden Markov model recognition results. The goal of the paper is to define a common data set and to encourage other researchers to use it. |
Year | Venue | DocType |
---|---|---|
2005 | INTERSPEECH | Conference |
Citations | PageRank | References |
4 | 0.45 | 4 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Simon King | 1 | 19 | 5.11 |
Chris Bartels | 2 | 77 | 5.96 |
Jeff Bilmesy | 3 | 4 | 0.45 |