Naoyuki Kanda - Citegraph

Author Info

Name	Affiliation	Papers
NAOYUKI KANDA	Hitachi Ltd, Cent Res Lab, 1-280 Higashi Koigakubo, Kokubunji, Tokyo 1858601, Japan	46
Collaborators	Citations	PageRank
104	103	19.45
Referers	Referees	References
321	678	238

Search Limit

100678

Publications (46 rows)

Collaborators (100 rows)

Referers (100 rows)

Referees (100 rows)

Title	Citations	PageRank	Year
VarArray: Array-Geometry-Agnostic Continuous Speech Separation.	0	0.34	2022
Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition	0	0.34	2022
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers Using End-to-End Speaker-Attributed ASR	0	0.34	2022
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing	7	0.42	2022
A review of speaker diarization: Recent advances with deep learning	2	0.41	2022
All-Neural Beamformer for Continuous Speech Separation.	0	0.34	2022
Streaming End-To-End Multi-Talker Speech Recognition	0	0.34	2021
On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer.	1	0.35	2021
Integration Of Speech Separation, Diarization, And Recognition For Multi-Speaker Meetings: System Description, Comparison, And Analysis	0	0.34	2021
Investigation Of End-To-End Speaker-Attributed Asr For Continuous Multi-Talker Recordings	0	0.34	2021
Streaming Multi-Talker Speech Recognition with Joint Speaker Identification.	0	0.34	2021
MINIMUM BAYES RISK TRAINING FOR END-TO-END SPEAKER-ATTRIBUTED ASR	0	0.34	2021
Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition.	1	0.35	2021
End-to-End Speaker-Attributed ASR with Transformer.	0	0.34	2021
A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio	0	0.34	2021
Exploring End-To-End Multi-Channel Asr With Bias Information For Meeting Transcription	0	0.34	2021
HYPOTHESIS STITCHER FOR END-TO-END SPEAKER-ATTRIBUTED ASR ON LONG-FORM MULTI-TALKER RECORDINGS	0	0.34	2021
INTERNAL LANGUAGE MODEL TRAINING FOR DOMAIN-ADAPTIVE END-TO-END SPEECH RECOGNITION	0	0.34	2021
Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone.	1	0.35	2021
Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition	2	0.37	2021
SPEECH-LANGUAGE PRE-TRAINING FOR END-TO-END SPOKEN LANGUAGE UNDERSTANDING	1	0.35	2021
Investigation of Practical Aspects of Single Channel Speech Separation for ASR.	1	0.35	2021
MICROSOFT SPEAKER DIARIZATION SYSTEM FOR THE VOXCELEB SPEAKER RECOGNITION CHALLENGE 2020	0	0.34	2021
Serialized Output Training for End-to-End Overlapped Speech Recognition	0	0.34	2020
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers	0	0.34	2020
Acoustic Modeling For Distant Multi-Talker Speech Recognition With Single- And Multi-Channel Branches	0	0.34	2019
Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition.	0	0.34	2019
Multimodal Response Obligation Detection with Unsupervised Online Domain Adaptation	0	0.34	2019
End-to-End Neural Speaker Diarization with Permutation-Free Objectives	7	0.55	2019
End-to-End Neural Speaker Diarization with Self-Attention	3	0.51	2019
Guided Source Separation Meets a Strong ASR Backend - Hitachi/Paderborn University Joint Investigation for Dinner Party ASR.	4	0.48	2019
Simultaneous Speech Recognition and Speaker Diarization for Monaural Dialogue Recordings with Target-Speaker Acoustic Models	0	0.34	2019
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR.	0	0.34	2019
Face-Voice Matching using Cross-modal Embeddings.	0	0.34	2018
Maximum-a-Posteriori-Based Decoding for End-to-End Acoustic Models.	1	0.35	2017
Investigation of lattice-free maximum mutual information-based acoustic models with sequence-level Kullback-Leibler divergence	0	0.34	2017
Combination of multiple acoustic models with unsupervised adaptation for lecture speech transcription.	1	0.39	2016
Training data pseudo-shuffling and direct decoding framework for recurrent neural network based acoustic modeling	1	0.35	2015
The NCT ASR system for IWSLT 2014.	0	0.34	2014
Multiple index combination for Japanese spoken term detection with optimum index selection based on OOV-region classifier	0	0.34	2013
Elastic Spectral Distortion For Low Resource Speech Recognition With Deep Neural Networks	12	0.86	2013
Voice activity detection based on augmented statistical noise suppression.	0	0.34	2012
A multi-expert model for dialogue and behavior control of conversational robots and agents	11	0.74	2011
Open-vocabulary keyword detection from super-large scale speech database	12	1.07	2008
Multi-domain spoken dialogue system with extensibility and robustness against speech recognition errors	25	1.32	2006
Contextual constraints based on dialogue models in database search task for spoken dialogue systems	10	0.79	2005