Jiangyan Yi - Citegraph

Author Info

Name	Affiliation	Papers
JIANGYAN YI	Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China	52
Collaborators	Citations	PageRank
61	19	17.99
Referers	Referees	References
74	416	138

Search Limit

100416

Publications (52 rows)

Collaborators (61 rows)

Referers (74 rows)

Referees (100 rows)

Title	Citations	PageRank	Year
Audio Deepfake Detection Based on a Combination of F0 Information and Real Plus Imaginary Spectrogram Features.	0	0.34	2022
Hybrid Autoregressive and Non-Autoregressive Transformer Models for Speech Recognition	0	0.34	2022
NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation	0	0.34	2022
ADD 2022: the first Audio Deep Synthesis Detection Challenge.	0	0.34	2022
Continual Learning for Fake Audio Detection.	1	0.36	2021
Rnn-transducer With Language Bias For End-to-end Mandarin-English Code-switching Speech Recognition	0	0.34	2021
PROSODY AND VOICE FACTORIZATION FOR FEW-SHOT SPEAKER ADAPTATION IN THE CHALLENGE M2VOC 2021	0	0.34	2021
BI-LEVEL STYLE AND PROSODY DECOUPLING MODELING FOR PERSONALIZED END-TO-END SPEECH SYNTHESIS	0	0.34	2021
Hierarchically Attending Time-Frequency and Channel Features for Improving Speaker Verification	0	0.34	2021
Deep Time Delay Neural Network for Speech Enhancement with Full Data Learning	0	0.34	2021
Half-Truth - A Partially Fake Audio Detection Dataset.	0	0.34	2021
DECOUPLING PRONUNCIATION AND LANGUAGE FOR END-TO-END CODE-SWITCHING AUTOMATIC SPEECH RECOGNITION	0	0.34	2021
Text Enhancement for Paragraph Processing in End-to-End Code-switching TTS	0	0.34	2021
FSR - Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization.	0	0.34	2021
PATNET : A PHONEME-LEVEL AUTOREGRESSIVE TRANSFORMER NETWORK FOR SPEECH SYNTHESIS	1	0.37	2021
Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition	0	0.34	2021
Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition	0	0.34	2020
Non-Autoregressive End-to-End TTS with Coarse-to-Fine Decoding.	0	0.34	2020
Dynamic Soft Windowing and Language Dependent Style Token for Code-Switching End-to-End Speech Synthesis.	0	0.34	2020
Focal Loss for Punctuation Prediction.	0	0.34	2020
Dynamic Speaker Representations Adjustment and Decoder Factorization for Speaker Adaptation in End-to-End Speech Synthesis.	0	0.34	2020
Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition	0	0.34	2020
Gated Recurrent Fusion of Spatial and Spectral Features for Multi-Channel Speech Separation with Deep Embedding Representations.	0	0.34	2020
Bi-Level Speaker Supervision for One-Shot Speech Synthesis.	0	0.34	2020
Joint Training for Simultaneous Speech Denoising and Dereverberation with Deep Embedding Representations.	0	0.34	2020
Spoken Content and Voice Factorization for Few-Shot Speaker Adaptation.	0	0.34	2020
A Public Chinese Dataset for Language Model Adaptation	0	0.34	2020
Self-Attention Transducers for End-to-End Speech Recognition	1	0.36	2019
A Time Delay Neural Network with Shared Weight Self-Attention for Small-Footprint Keyword Spotting	0	0.34	2019
Self-Attention Based Model For Punctuation Prediction Using Word And Speech Embeddings	0	0.34	2019
Language-Invariant Bottleneck Features From Adversarial End-To-End Acoustic Models For Low Resource Speech Recognition	0	0.34	2019
Forward–Backward Decoding Sequence for Regularizing End-to-End TTS	1	0.37	2019
Language-Adversarial Transfer Learning for Low-Resource Speech Recognition.	3	0.39	2019
Voice Activity Detection Based on Time-Delay Neural Networks	1	0.35	2019
Distilling Knowledge for Distant Speech Recognition via Parallel Data	0	0.34	2019
Batch Normalization based Unsupervised Speaker Adaptation for Acoustic Models	0	0.34	2019
Focal Loss for End-to-end Short Utterances Chinese Dialect Identification	0	0.34	2019
Noise Prior Knowledge Learning for Speech Enhancement via Gated Convolutional Generative Adversarial Network	0	0.34	2019
Discriminative Learning for Monaural Speech Separation Using Deep Embedding Features	2	0.40	2019
Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition	1	0.34	2019
Hypersphere Embedding and Additive Margin for Query-by-example Keyword Spotting	0	0.34	2019
CLMAD: A Chinese Language Model Adaptation Dataset	0	0.34	2018
Research on Dynamic and Static Fusion Polymorphic Gesture Recognition Algorithm for Interactive Teaching Interface.	0	0.34	2018
Utterance-level Permutation Invariant Training with Discriminative Learning for Single Channel Speech Separation	0	0.34	2018
Distilling Knowledge Using Parallel Data for Far-field Speech Recognition.	0	0.34	2018
CTC regularized model adaptation for improving LSTM RNN based multi-accent Mandarin speech recognition.	1	0.35	2018
Distilling Knowledge From An Ensemble Of Models For Punctuation Prediction	0	0.34	2017
Continuous Multimodal Emotion Prediction Based on Long Short Term Memory Recurrent Neural Network.	7	0.48	2017
Improving BLSTM RNN based Mandarin speech recognition using accent dependent bottleneck features.	0	0.34	2016
Improving accented Mandarin speech recognition by using recurrent neural network based language model adaptation	0	0.34	2016

1
2
50 / page