Title
The Broadcast Narrow Band Speech Corpus: A New Resource Type For Large Scale Language Recognition
Abstract
This paper describes a new resource type, broadcast narrow band speech for use in large scale language recognition research and technology development. After providing the rational for this new resource type, the paper describes the collection, segmentation, auditing procedures and data formats used. Along the way, it addresses issues of defining language and dialect in found data and how ground truth is established for this corpus.
Year
Venue
Keywords
2009
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5
multilingual speech corpora, language recognition, language identification, language detection, language, dialect, mutual intelligibility, broadcast news, conversational speech
Field
DocType
Citations 
Speech corpus,Broadcasting,Computer science,Speech recognition,Language recognition,Natural language processing,Artificial intelligence,Narrow band
Conference
1
PageRank 
References 
Authors
0.36
1
8
Name
Order
Citations
PageRank
Christopher Cieri112342.44
Linda Brandschain282.31
Abby Neely341.47
David Graff47123.77
Kevin Walker56521.51
Chris Caruso621.07
Alvin F. Martin71289194.28
Craig S. Greenberg8449.33