Name
Affiliation
Papers
TOMOKI TODA
Nara Institute of Science and Technology|Spoken Language Translation Research Laboratories|ATR Human Information Sciences
265
Collaborators
Citations 
PageRank 
321
1874
167.18
Referers 
Referees 
References 
2329
2199
2020
Search Limit
1001000
Title
Citations
PageRank
Year
Direct Noisy Speech Modeling for Noisy-To-Noisy Voice Conversion.00.342022
Unified Source-Filter GAN with Harmonic-plus-Noise Source Excitation Generation.00.342022
Generalization Ability of MOS Prediction Networks00.342022
Investigation of Japanese PnG BERT Language Model in Text-to-Speech Synthesis for Pitch Accent Language00.342022
Spoken-Text-Style Transfer with Conditional Variational Autoencoder and Content Word Storage.00.342022
HASA-NET: A NON-INTRUSIVE HEARING-AID SPEECH ASSESSMENT NETWORK00.342021
Mandarin Electro-Laryngeal Speech Enhancement based on Statistical Voice Conversion and Manual Tone Control00.342021
Anomalous Sound Detection Using a Binary Classification Model and Class Centroids00.342021
SPEECH RECOGNITION BY SIMPLY FINE-TUNING BERT00.342021
Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence Modeling00.342021
Quasi-Periodic Parallel WaveGAN Vocoder: A Non-autoregressive Pitch-dependent Dilated Convolution Model for Parametric Speech Generation00.342020
Implementation of low-latency electrolaryngeal speech enhancement based on multi-task CLDNN00.342020
Intelligibility Enhancement Based on Speech Waveform Modification Using Hearing Impairment.00.342020
Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining10.352020
A Cyclical Post-filtering Approach to Mismatch Refinement of Neural Vocoder for Text-to-speech Systems00.342020
Semi-Supervised Self-Produced Speech Enhancement and Suppression Based on Joint Source Modeling of Air- and Body-Conducted Signals Using Variational Autoencoder.00.342020
Development of a Real-time Bionic Voice Generation System based on Statistical Excitation Prediction00.342019
Investigation of Shallow Wavenet Vocoder with Laplacian Distribution Output00.342019
Investigation of F0 Conditioning and Fully Convolutional Networks in Variational Autoencoder Based Voice Conversion.10.352019
Non-Parallel Voice Conversion with Cyclic Variational Autoencoder20.362019
Underdetermined Source Separation Based on Generalized Multichannel Variational Autoencoder00.342019
Robustness of Statistical Voice Conversion Based on Direct Waveform Modification Against Background Sounds00.342019
An Evaluation of Deep Spectral Mappings and WaveNet Vocoder for Voice Conversion.10.342018
Connectionist Temporal Classification-based Sound Event Encoder for Converting Sound Events into Onomatopoeic Representations.00.342018
The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods.70.442018
NU Voice Conversion System for the Voice Conversion Challenge 2018.10.342018
An investigation of how to design control parameters for statistical voice timbre control.00.342017
Physically Constrained Statistical F0 Prediction for Electrolaryngeal Speech Enhancement.00.342017
A Vibration Control Method Of An Electrolarynx Based On Statistical F-0 Pattern Prediction00.342017
Speech Enhancement Using Non-Negative Spectrogram Models With Mel-Generalized Cepstral Regularization00.342017
Duration-Controlled LSTM for Polyphonic Sound Event Detection.110.672017
A Modulation Property Of Time-Frequency Derivatives Of Filtered Phase And Its Application To Aperiodicity And F(O) Estimation00.342017
Speaker-Dependent Wavenet Vocoder100.612017
Deep acoustic-to-articulatory inversion mapping with latent trajectory modeling.00.342017
Postfilters to Modify the Modulation Spectrum for Statistical Parametric Speech Synthesis.120.932016
Teaching Social Communication Skills Through Human-Agent Interaction.20.392016
F0 transformation techniques for statistical voice conversion with direct waveform modification with spectral differential.00.342016
Combination Of Two-Dimensional Cochleogram And Spectrogram Features For Deep Learning-Based Asr10.372015
Articulatory Controllable Speech Modification Based On Gaussian Mixture Models With Direct Waveform Modification Using Spectrum Differential10.352015
Pseudogen: A Tool to Automatically Generate Pseudo-Code from Source Code20.372015
Modulation Spectrum-Constrained Trajectory Training Algorithm For Gmm-Based Voice Conversion70.442015
Sas : A Speaker Verification Spoofing Database Containing Diverse Attacks210.772015
The NAIST ASR system for the 2015 Multi-Genre Broadcast challenge: On combination of deep learning systems using a rank-score function20.392015
A Latent Variable Model For Joint Pause Prediction And Dependency Parsing10.372015
Automated Social Skills Trainer40.462015
An Enhanced Electrolarynx with Automatic Fundamental Frequency Control based on Statistical Prediction00.342015
Incremental sentence compression using LSTM recurrent networks00.342015
Modulation Spectrum-Constrained Trajectory Training Algorithm For Hmm-Based Speech Synthesis10.362015
Adaptive Selection From Multiple Response Candidates In Example-Based Dialogue00.342015
Construction and analysis of social-affective interaction corpus in English and Indonesian10.352015
  • 1
  • 2