Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition | 0 | 0.34 | 2022 |
Multilingual Second-Pass Rescoring for Automatic Speech Recognition Systems | 0 | 0.34 | 2022 |
A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model Personalization | 0 | 0.34 | 2022 |
Ask2Mask: Guided Data Selection for Masked Speech Modeling | 0 | 0.34 | 2022 |
MAESTRO: Matched Speech Text Representations through Modality Matching | 0 | 0.34 | 2022 |
Tts4pretrain 2.0: Advancing the use of Text and Speech in ASR Pretraining with Consistency and Contrastive Losses | 0 | 0.34 | 2022 |
EXTENDING PARROTRON: AN END-TO-END, SPEECH CONVERSION AND SPEECH RECOGNITION MODEL FOR ATYPICAL SPEECH | 0 | 0.34 | 2021 |
MIXTURE OF INFORMED EXPERTS FOR MULTILINGUAL SPEECH RECOGNITION | 0 | 0.34 | 2021 |
Improving Speech Recognition Using GAN-Based Speech Synthesis and Contrastive Unspoken Text Selection. | 0 | 0.34 | 2020 |
SCADA - Stochastic, Consistent and Adversarial Data Augmentation to Improve ASR. | 0 | 0.34 | 2020 |
Multilingual Speech Recognition with Self-Attention Structured Parameterization. | 0 | 0.34 | 2020 |
Speech Recognition With Augmented Synthesized Speech | 1 | 0.36 | 2019 |
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation. | 0 | 0.34 | 2019 |
Leveraging Language Id In Multilingual End-To-End Speech Recognition | 0 | 0.34 | 2019 |
From Audio to Semantics: Approaches to end-to-end spoken language understanding. | 0 | 0.34 | 2018 |
Transliteration Based Approaches to Improve Code-Switched Speech Recognition Performance. | 0 | 0.34 | 2018 |
Syllable-based acoustic modeling with CTC-SMBR-LSTM | 0 | 0.34 | 2017 |
The geography of university scientific production in Europe: an exploration in the field of Food Science and Technology. | 0 | 0.34 | 2017 |
On the use of deep feedforward neural networks for automatic language identification. | 10 | 0.53 | 2016 |
Towards acoustic model unification across dialects | 1 | 0.39 | 2016 |
High quality agreement-based semi-supervised training data for acoustic modeling | 0 | 0.34 | 2016 |
A Real-Time End-to-End Multilingual Speech Recognition Architecture | 5 | 0.45 | 2015 |
Bringing Contextual Information To Google Speech Recognition | 7 | 0.59 | 2015 |
Frame-by-frame language identification in short utterances using deep neural networks. | 10 | 0.89 | 2015 |
Multi-Dialectical Languages Effect on Speech Recognition: Too Much Choice Can Hurt | 1 | 0.65 | 2015 |
Improved Recognition Of Contact Names In Voice Commands | 5 | 0.65 | 2015 |
Backoff inspired features for maximum entropy language models. | 4 | 0.70 | 2014 |
A big data approach to acoustic model training corpus selection. | 7 | 0.91 | 2014 |
Automatic language identification using long short-term memory recurrent neural networks. | 16 | 0.75 | 2014 |
Automatic language identification using deep neural networks | 39 | 1.64 | 2014 |
Deploying Google Search By Voice In Cantonese | 4 | 0.45 | 2011 |
Efficient and robust music identification with weighted finite-state transducers | 5 | 0.44 | 2010 |
Discriminative Topic Segmentation of Text and Speech | 0 | 0.34 | 2010 |
A factor automaton approach for the forced alignment of long speech recordings | 21 | 1.44 | 2009 |
A New Quality Measure For Topic Segmentation Of Text And Speech | 1 | 0.36 | 2009 |
Audiovisual celebrity recognition in unconstrained web videos | 15 | 1.06 | 2009 |
Supervised Learning of Semantic Classes for Image Annotation and Retrieval | 488 | 16.00 | 2007 |
Robust Music Identification, Detection, and Analysis. | 3 | 0.39 | 2007 |
Query by semantic example | 12 | 0.76 | 2006 |
SVM kernel adaptation in speaker classification and verification | 2 | 0.40 | 2004 |
A New SVM Approach to Speaker Identification and Verification Using Probabilistic Distance Kernels | 20 | 1.24 | 2003 |
A Kullback-Leibler Divergence Based Kernel for SVM Classification in Multimedia Applications | 164 | 16.05 | 2003 |
From multimedia retrieval to knowledge management | 10 | 0.90 | 2002 |
Speechbot: an experimental speech-based search engine for multimedia content on the web | 24 | 1.82 | 2002 |
A boosting approach for confidence scoring | 22 | 1.94 | 2001 |
Topic segmentation with an aspect hidden Markov model | 87 | 4.77 | 2001 |
An experimental study of an audio indexing system for the web | 28 | 2.30 | 2000 |
SpeechBot: a Speech Recognition based Audio Indexing System for the Web | 18 | 1.69 | 2000 |
Indexing Multimedia for the Internet | 5 | 1.83 | 1999 |
Data-driven environmental compensation for speech recognition: a unified approach | 36 | 2.33 | 1998 |