Abstract | ||
---|---|---|
There has been renewed interest in the field of automatic language identification over the past two years. The advent of a public-domain ten-language corpus of telephone speech has made the evaluation of different approaches to automatic language identification feasible. In an effort to provide benchmarks for evaluating machine performance, we conducted perceptual experiments on 1-, 2-, 4- and 6-second excerpts of telephone speech excised from spontaneous speech utterances in this corpus. The subject population consisted of 10 native speakers of English and 2 speakers from each of the remaining 9 languages. Statistical analyses of our results indicate that duration of the excerpt, familiarity with the language, and number of languages known are important factors affecting a subject's performance on the identification task |
Year | DOI | Venue |
---|---|---|
1994 | 10.1109/ICASSP.1994.389288 | Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference |
Keywords | Field | DocType |
natural languages,speech recognition,statistical analysis,automatic language identification,machine performance,perceptual benchmarks,public-domain ten-language corpus,spontaneous speech utterances,statistical analyses,telephone speech | Population,Automatic language identification,Computer science,Speech recognition,NIST,Natural language,Natural language processing,Artificial intelligence,Telephony,Perception,Statistical analysis | Conference |
Volume | ISSN | ISBN |
i | 1520-6149 | 0-7803-1775-0 |
Citations | PageRank | References |
28 | 3.07 | 1 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yeshwant K. Muthusamy | 1 | 136 | 24.25 |
Neena Jain | 2 | 28 | 3.07 |
Ronald A. Cole | 3 | 686 | 187.46 |