Title
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition
Abstract
We summarize the results of a host of efforts using giant automatic speech recognition (ASR) models pre-trained using large, diverse unlabeled datasets containing approximately a million hours of audio. We find that the combination of pre-training, self-training and scaling up model size greatly increases data efficiency, even for extremely large tasks with tens of thousands of hours of labeled data. In particular, on an ASR task with 34 k hours of labeled data, by fine-tuning an 8 billion parameter pre-trained Conformer model we can match state-of-the-art (SoTA) performance with only 3% of the training data and significantly improve SoTA with the full training set. We also report on the universal benefits gained from using big pre-trained and self-trained models for a large set of downstream tasks that cover a wide range of speech domains and span multiple orders of magnitudes of dataset sizes, including obtaining SoTA performance on many public benchmarks. In addition, we utilize the learned representation of pre-trained networks to achieve SoTA results on non-ASR tasks.
Year
DOI
Venue
2022
10.1109/JSTSP.2022.3182537
IEEE Journal of Selected Topics in Signal Processing
Keywords
DocType
Volume
Giant model,large-scale self-supervisedlearning,self-supervised learning,semisupervised learning,speech recognition
Journal
16
Issue
ISSN
Citations 
6
1932-4553
2
PageRank 
References 
Authors
0.37
25
26
Name
Order
Citations
PageRank
Yu Zhang144241.79
Daniel S. Park2223.46
Wei Han320.37
James Qin4133.68
Anmol Gulati5243.31
Joel Shor6555.47
Lorena Álvarez750436.47
Yuanzhong Xu82249.30
Yanping Huang92109.80
Shibo Wang1020.37
Zongwei Zhou1120.70
Bo Li1220642.46
Min Ma1320.70
William Chan1435724.67
Jiahui Yu1526025.83
Yongqiang Wang1620.37
liangliang cao17181690.71
Khe Chai Sim1830031.13
Bhuvana Ramabhadran191779153.83
Tara N. Sainath203497232.43
Françoise Beaufays2120.37
Zhifeng Chen222747106.75
Quoc V. Le238501366.59
Chung-Cheng Chiu2424828.00
Ruoming Pang25109292.99
Yonghui Wu26106572.78