A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer | 0 | 0.34 | 2022 |
Deep Segment Model for Acoustic Scene Classification | 0 | 0.34 | 2022 |
End-to-End Audio-Visual Neural Speaker Diarization | 0 | 0.34 | 2022 |
Using Paralinguistic Information To Disambiguate User Intentions For Distinguishing Phrase Structure And Sarcasm In Spoken Dialog Systems | 0 | 0.34 | 2021 |
Speech Emotion Recognition Based on Acoustic Segment Model | 0 | 0.34 | 2021 |
Speech Enhancement Based on Teacher–Student Deep Learning Using Improved Speech Presence Probability for Noise-Robust Speech Recognition | 5 | 0.51 | 2019 |
A Hybrid Approach to Acoustic Scene Classification Based on Universal Acoustic Models | 0 | 0.34 | 2019 |
Improving Mispronunciation Detection of Mandarin Tones for Non-Native Learners With Soft-Target Tone Labels and BLSTM-Based Deep Tone Models | 0 | 0.34 | 2019 |
A Cross-Entropy-Guided (CEG) Measure for Speech Enhancement Front-End Assessing Performances of Back-End Automatic Speech Recognition | 1 | 0.35 | 2019 |
Improving Mandarin Tone Recognition Based on DNN by Combining Acoustic and Articulatory Features Using Extended Recognition Networks. | 0 | 0.34 | 2018 |
Acoustics-guided evaluation (AGE): a new measure for estimating performance of speech enhancement algorithms for robust ASR. | 1 | 0.36 | 2018 |
Online LSTM-based Iterative Mask Estimation for Multi-Channel Speech Enhancement and ASR | 1 | 0.35 | 2018 |
Image region annotation based on segmentation and semantic correlation analysis. | 0 | 0.34 | 2018 |
A Multiobjective Learning and Ensembling Approach to High-Performance Speech Enhancement With Compact Neural Network Architectures. | 3 | 0.39 | 2018 |
A Progressive Deep Learning Approach to Child Speech Separation | 0 | 0.34 | 2018 |
An End-to-End Deep Learning Approach to Simultaneous Speech Dereverberation and Acoustic Modeling for Robust Speech Recognition. | 5 | 0.42 | 2017 |
On Design Of Robust Deep Models For Chime-4 Multi-Channel Speech Recognition With Multiple Configurations Of Array Microphones | 2 | 0.37 | 2017 |
Joint Training Of Multi-Channel-Condition Dereverberation And Acoustic Modeling Of Microphone Array Speech For Robust Distant Speech Recognition | 0 | 0.34 | 2017 |
A Reverberation-Time-Aware Approach to Speech Dereverberation Based on Deep Neural Networks. | 17 | 0.87 | 2017 |
Hierarchical Bayesian combination of plug-in maximum a posteriori decoders in deep neural networks-based speech recognition and speaker adaptation. | 2 | 0.37 | 2017 |
A speaker-dependent deep learning approach to joint speech separation and acoustic modeling for multi-talker automatic speech recognition | 0 | 0.34 | 2016 |
Learning auxiliary categorical information for speech synthesis based on deep and recurrent neural networks | 0 | 0.34 | 2016 |
Zero resource anti-spoofing detection for unit selection based synthetic speech using image spectrogram artifacts. | 0 | 0.34 | 2016 |
Towards a direct Bayesian adaptation framework for deep models. | 0 | 0.34 | 2016 |
Using tone-based extended recognition network to detect non-native Mandarin tone mispronunciations. | 0 | 0.34 | 2016 |
An information fusion approach to recognizing microphone array speech in the CHiME-3 challenge based on a deep learning framework | 5 | 0.40 | 2015 |
A Unified Speaker-Dependent Speech Separation And Enhancement System Based On Deep Neural Networks | 2 | 0.36 | 2015 |
A regression approach to speech enhancement based on deep neural networks | 174 | 4.83 | 2015 |
A Probabilistic Framework for Representing Dialog Systems and Entropy-Based Dialog Management through Dynamic Stochastic State Evolution. | 5 | 0.62 | 2015 |
A Deep Neural Network Approach To Speech Bandwidth Expansion | 12 | 0.68 | 2015 |
A fusion approach to spoken language identification based on combining multiple phone recognizers and speech attribute detectors | 0 | 0.34 | 2014 |
Attribute based lattice rescoring in spontaneous speech recognition | 0 | 0.34 | 2014 |
Feature space maximum a posteriori linear regression for adaptation of deep neural networks. | 7 | 0.49 | 2014 |
Cross-language transfer learning for deep neural network based speech enhancement | 2 | 0.37 | 2014 |
Speech separation based on improved deep neural networks with dual outputs of speech features for both target and interfering speakers | 16 | 0.70 | 2014 |
An Experimental Study on Speech Enhancement Based on Deep Neural Networks. | 56 | 2.27 | 2014 |
A MAP-based Online Estimation Approach to Ensemble Speaker and Speaking Environment Modeling | 3 | 0.37 | 2014 |
A keyword-boosted sMBR criterion to enhance keyword search performance in deep neural network based acoustic modeling. | 4 | 0.39 | 2014 |
A maximal figure-of-merit learning approach to maximizing mean average precision with deep neural network based classifiers | 2 | 0.41 | 2014 |
Cluster-based analysis for characterizing dynamic functional connectivity. | 2 | 0.41 | 2014 |
A ridge ensemble empirical mode decomposition approach to clutter rejection for ultrasound color flow imaging. | 5 | 0.62 | 2013 |
Speech Recognition Using Long-Span Temporal Patterns in a Deep Network Model | 11 | 0.52 | 2013 |
Online whole-word and stroke-based modeling for hand-written letter recognition in in-car environments | 2 | 0.46 | 2013 |
A Bottom-Up Modular Search Approach to Large Vocabulary Continuous Speech Recognition | 5 | 0.44 | 2013 |
A new confidence measure combining Hidden Markov Models and Artificial Neural Networks of phonemes for effective keyword spotting | 0 | 0.34 | 2012 |
Per-Exemplar Fusion Learning for Video Retrieval and Recounting | 0 | 0.34 | 2012 |
Experiments on Cross-Language Attribute Detection and Phone Recognition With Minimal Target-Specific Training Data | 34 | 1.47 | 2012 |
GENIE TRECVID 2011 multimedia event detection: late-fusion approaches to combine multiple audio--visual features | 3 | 0.38 | 2011 |
Maximum Confidence Measure Based Interaural Phase Difference Estimation For Noise Masking In Dual-Microphone Robust Speech Recognition | 0 | 0.34 | 2011 |
A Kernel Framework for Content-Based Artist Recommendation System in Music | 11 | 0.55 | 2011 |