Title
Vowel based Voice Activity Detection with LSTM Recurrent Neural Network.
Abstract
Voice activity detection (VAD) determines whether the incoming signal segments are speech or noiseand is an important technique in almost all of speech-related applications. In order to improve VAD performance in various noise environments, characterizing the speech feature has been the most crucial issue up to date. Among several proposed speech features, the context information of speech through time and vowel sound characteristics are known to current state-of-the-art speech features. Therefore, in order to reflect both on these merits, we propose vowel based VAD by Long short term memory recurrent neural network (LSTM-RNN). LSTM-RNN is known to the powerful model to capture dynamical context information through time. Moreover, with teaching the LSTM-RNN to only vowel sounds rather than whole speech, LSTM-RNN can learn more effectively because of the reduced manifold of speech. According to our experiments, proposed method shows better accuracy not only in the VAD task compared to LSTM-RNN based VAD but alsoa vowel detection task.
Year
DOI
Venue
2016
10.1145/3015166.3015207
ICSPS
Field
DocType
Citations 
Speech processing,Voice activity detection,Computer science,Recurrent neural network,Long short term memory,Speech recognition,Vowel
Conference
0
PageRank 
References 
Authors
0.34
3
5
Name
Order
Citations
PageRank
Juntae Kim198.72
Jaeseok Kim240558.33
Seunghyung Lee302.37
Jinuk Park422.74
Minsoo Hahn522346.63