Title | ||
---|---|---|
The Broadcast Narrow Band Speech Corpus: A New Resource Type For Large Scale Language Recognition |
Abstract | ||
---|---|---|
This paper describes a new resource type, broadcast narrow band speech for use in large scale language recognition research and technology development. After providing the rational for this new resource type, the paper describes the collection, segmentation, auditing procedures and data formats used. Along the way, it addresses issues of defining language and dialect in found data and how ground truth is established for this corpus. |
Year | Venue | Keywords |
---|---|---|
2009 | INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | multilingual speech corpora, language recognition, language identification, language detection, language, dialect, mutual intelligibility, broadcast news, conversational speech |
Field | DocType | Citations |
Speech corpus,Broadcasting,Computer science,Speech recognition,Language recognition,Natural language processing,Artificial intelligence,Narrow band | Conference | 1 |
PageRank | References | Authors |
0.36 | 1 | 8 |
Name | Order | Citations | PageRank |
---|---|---|---|
Christopher Cieri | 1 | 123 | 42.44 |
Linda Brandschain | 2 | 8 | 2.31 |
Abby Neely | 3 | 4 | 1.47 |
David Graff | 4 | 71 | 23.77 |
Kevin Walker | 5 | 65 | 21.51 |
Chris Caruso | 6 | 2 | 1.07 |
Alvin F. Martin | 7 | 1289 | 194.28 |
Craig S. Greenberg | 8 | 44 | 9.33 |