Name
Affiliation
Papers
PEDRO J. MORENO
Department of Electrical and Computer Engineering and School of Computer Science|Carnegie Mellon University
63
Collaborators
Citations 
PageRank 
134
1256
114.37
Referers 
Referees 
References 
2917
681
390
Search Limit
1001000
Title
Citations
PageRank
Year
Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition00.342022
Multilingual Second-Pass Rescoring for Automatic Speech Recognition Systems00.342022
A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model Personalization00.342022
Ask2Mask: Guided Data Selection for Masked Speech Modeling00.342022
MAESTRO: Matched Speech Text Representations through Modality Matching00.342022
Tts4pretrain 2.0: Advancing the use of Text and Speech in ASR Pretraining with Consistency and Contrastive Losses00.342022
EXTENDING PARROTRON: AN END-TO-END, SPEECH CONVERSION AND SPEECH RECOGNITION MODEL FOR ATYPICAL SPEECH00.342021
MIXTURE OF INFORMED EXPERTS FOR MULTILINGUAL SPEECH RECOGNITION00.342021
Improving Speech Recognition Using GAN-Based Speech Synthesis and Contrastive Unspoken Text Selection.00.342020
SCADA - Stochastic, Consistent and Adversarial Data Augmentation to Improve ASR.00.342020
Multilingual Speech Recognition with Self-Attention Structured Parameterization.00.342020
Speech Recognition With Augmented Synthesized Speech10.362019
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation.00.342019
Leveraging Language Id In Multilingual End-To-End Speech Recognition00.342019
From Audio to Semantics: Approaches to end-to-end spoken language understanding.00.342018
Transliteration Based Approaches to Improve Code-Switched Speech Recognition Performance.00.342018
Syllable-based acoustic modeling with CTC-SMBR-LSTM00.342017
The geography of university scientific production in Europe: an exploration in the field of Food Science and Technology.00.342017
On the use of deep feedforward neural networks for automatic language identification.100.532016
Towards acoustic model unification across dialects10.392016
High quality agreement-based semi-supervised training data for acoustic modeling00.342016
A Real-Time End-to-End Multilingual Speech Recognition Architecture50.452015
Bringing Contextual Information To Google Speech Recognition70.592015
Frame-by-frame language identification in short utterances using deep neural networks.100.892015
Multi-Dialectical Languages Effect on Speech Recognition: Too Much Choice Can Hurt10.652015
Improved Recognition Of Contact Names In Voice Commands50.652015
Backoff inspired features for maximum entropy language models.40.702014
A big data approach to acoustic model training corpus selection.70.912014
Automatic language identification using long short-term memory recurrent neural networks.160.752014
Automatic language identification using deep neural networks391.642014
Deploying Google Search By Voice In Cantonese40.452011
Efficient and robust music identification with weighted finite-state transducers50.442010
Discriminative Topic Segmentation of Text and Speech00.342010
A factor automaton approach for the forced alignment of long speech recordings211.442009
A New Quality Measure For Topic Segmentation Of Text And Speech10.362009
Audiovisual celebrity recognition in unconstrained web videos151.062009
Supervised Learning of Semantic Classes for Image Annotation and Retrieval48816.002007
Robust Music Identification, Detection, and Analysis.30.392007
Query by semantic example120.762006
SVM kernel adaptation in speaker classification and verification20.402004
A New SVM Approach to Speaker Identification and Verification Using Probabilistic Distance Kernels201.242003
A Kullback-Leibler Divergence Based Kernel for SVM Classification in Multimedia Applications16416.052003
From multimedia retrieval to knowledge management100.902002
Speechbot: an experimental speech-based search engine for multimedia content on the web241.822002
A boosting approach for confidence scoring221.942001
Topic segmentation with an aspect hidden Markov model874.772001
An experimental study of an audio indexing system for the web282.302000
SpeechBot: a Speech Recognition based Audio Indexing System for the Web181.692000
Indexing Multimedia for the Internet51.831999
Data-driven environmental compensation for speech recognition: a unified approach362.331998
  • 1
  • 2