Title
Toward Human Parity in Conversational Speech Recognition.
Abstract
Conversational speech recognition has served as a flagship speech recognition task since the release of the Switchboard corpus in the 1990s. In this paper, we measure a human error rate on the widely used NIST 2000 test set for commercial bulk transcription. The error rate of professional transcribers is 5.9% for the Switchboard portion of the data, in which newly acquainted pairs of people discus...
Year
DOI
Venue
2017
10.1109/TASLP.2017.2756440
IEEE/ACM Transactions on Audio, Speech, and Language Processing
Keywords
Field
DocType
Speech recognition,Error analysis,Spatial analysis,Recurrent neural networks,NIST,Acoustics
Transcription (linguistics),Computer science,Word error rate,Recurrent neural network,Speech recognition,Smoothing,Discriminative model,Language model,Acoustic model,Test set
Journal
Volume
Issue
ISSN
25
12
2329-9290
Citations 
PageRank 
References 
11
0.56
41
Authors
8
Name
Order
Citations
PageRank
Wayne Xiong1221.86
Jasha Droppo286168.35
Xuedong Huang31390283.19
frank seide41489101.15
Michael L. Seltzer5102769.42
Andreas Stolcke66690712.46
Dong Yu76264475.73
Geoffrey Zweig83406320.25