Name
Affiliation
Papers
YANMIN QIAN
Tsinghua Univ, Dept Elect Engn, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
110
Collaborators
Citations 
PageRank 
215
295
44.44
Referers 
Referees 
References 
690
924
646
Search Limit
100924
Title
Citations
PageRank
Year
Exploring Effective Data Utilization for Low-Resource Speech Recognition.00.342022
MLP-SVNET: A Multi-Layer Perceptrons Based Network for Speaker Verification00.342022
The Sjtu System For Multimodal Information Based Speech Processing Challenge 2021.00.342022
Knowledge Transfer and Distillation from Autoregressive to Non-Autoregessive Speech Recognition00.342022
Time-Domain Audio-Visual Speech Separation on Low Quality Videos00.342022
Local Information Modeling with Self-Attention for Speaker Verification00.342022
Separating Long-Form Speech with Group-wise Permutation Invariant Training.00.342022
Skim: Skipping Memory Lstm for Low-Latency Real-Time Continuous Speech Separation00.342022
MSDWild: Multi-modal Speaker Diarization Dataset in the Wild00.342022
Attentive Feature Fusion for Robust Speaker Verification00.342022
End-to-End Dereverberation, Beamforming, and Speech Recognition in a Cocktail Party00.342022
Punctuation Prediction for Streaming On-Device Speech Recognition.00.342022
Dual Path Embedding Learning for Speaker Verification with Triplet Attention00.342022
DF-ResNet: Boosting Speaker Verification Performance with Depth-First Design00.342022
Enroll-Aware Attentive Statistics Pooling for Target Speaker Verification.00.342022
Optimizing Data Usage for Low-Resource Speech Recognition00.342022
Large-Scale Self-Supervised Speech Representation Learning for Automatic Speaker Verification00.342022
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding00.342022
Speaker Embedding Augmentation with Noise Distribution Matching00.342021
CONVOLUTIVE TRANSFER FUNCTION INVARIANT SDR TRAINING CRITERIA FOR MULTI-CHANNEL REVERBERANT SPEECH SEPARATION00.342021
Data Augmentation for end-to-end Code-Switching Speech Recognition00.342021
TOWARDS DATA SELECTION ON TTS DATA FOR CHILDREN'S SPEECH RECOGNITION00.342021
Dual-Path Rnn For Long Recording Speech Separation00.342021
END-TO-END DEREVERBERATION, BEAMFORMING, AND SPEECH RECOGNITION WITH IMPROVED NUMERICAL STABILITY AND ADVANCED FRONTEND10.352021
Revisiting the Statistics Pooling Layer in Deep Speaker Embedding Learning00.342021
SELF-SUPERVISED LEARNING BASED DOMAIN ADAPTATION FOR ROBUST SPEAKER VERIFICATION00.342021
The SJTU System for Short-Duration Speaker Verification Challenge 2021.00.342021
Basis-MelGAN - Efficient Neural Vocoder Based on Audio Decomposition.00.342021
UNIT SELECTION SYNTHESIS BASED DATA AUGMENTATION FOR FIXED PHRASE SPEAKER VERIFICATION00.342021
AISPEECH-SJTU ASR SYSTEM FOR THE ACCENTED ENGLISH SPEECH RECOGNITION CHALLENGE00.342021
Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions10.352021
SYNAUG: SYNTHESIS-BASED DATA AUGMENTATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION00.342021
End-to-End Multi-speaker Speech Recognition with Transformer20.422020
Listen, Watch and Understand at the Cocktail Party - Audio-Visual-Contextual Speech Separation.20.382020
End-to-End Far-Field Speech Recognition with Unified Dereverberation and Beamforming10.352020
Dual-Adversarial Domain Adaptation for Generalized Replay Attack Detection.10.382020
Multi-Modality Matters - A Performance Leap on VoxCeleb.00.342020
Learning Contextual Language Embeddings for Monaural Multi-Talker Speech Recognition.00.342020
Adversarial Domain Adaptation for Speaker Verification Using Partially Shared Network.00.342020
Knowledge Distillation For Small Foot-Print Deep Speaker Embedding00.342019
Cross-Domain Replay Spoofing Attack Detection Using Domain Adversarial Training00.342019
Data augmentation using generative adversarial networks for robust speech recognition.20.352019
Margin Matters - Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition.20.382019
Exploring Model Units and Training Strategies for End-to-End Speech Recognition00.342019
Discriminative Neural Embedding Learning for Short-Duration Text-Independent Speaker Verification20.392019
Knowledge Distillation for End-to-End Monaural Multi-Talker ASR System00.342019
Robust DOA Estimation Based on Convolutional Neural Network and Time-Frequency Masking40.462019
On the Usage of Phonetic Information for Text-Independent Speaker Embedding Extraction10.342019
MIMO-Speech: End-to-End Multi-Channel Multi-Speaker Speech Recognition40.492019
GANs for Children: A Generative Data Augmentation Strategy for Children Speech Recognition00.342019
  • 1
  • 2