Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets | 0 | 0.34 | 2022 |
Data-Driven Parametric Text Normalization - Rapidly Scaling Finite-State Transduction Verbalizers to New Languages. | 0 | 0.34 | 2020 |
Building Large-Vocabulary ASR Systems for Languages Without Any Audio Training Data | 1 | 0.36 | 2019 |
Developing Pronunciation Models in New Languages Faster by Exploiting Common Grapheme-to-Phoneme Correspondences Across Languages | 0 | 0.34 | 2019 |
Unified Verbalization for Speech Recognition & Synthesis Across Languages | 0 | 0.34 | 2019 |
Automatic Keyboard Layout Design for Low-Resource Latin-Script Languages. | 0 | 0.34 | 2019 |
Mining Training Data for Language Modeling Across the World's Languages | 1 | 0.63 | 2018 |
Text Normalization Infrastructure that Scales to Hundreds of Language Varieties. | 0 | 0.34 | 2018 |
Building Speech Recognition Systems for Language Documentation: The CoEDL Endangered Language Pipeline and Inference System (ELPIS) | 0 | 0.34 | 2018 |