Title
Comparison Of Forced-Alignment Speech Recognition And Humans For Generating Reference Vad
Abstract
This present paper aims to answer the question whether forced-alignment speech recognition can be used as an altemative to humans in generating reference Voice Activity Detection (VAD) transcriptions. An investigation of the level of agreement between automatic/manual VAD transcriptions and the reference ones produced by a human expert was carried out. Thereafter, statistical analysis was employed on the automatically produced and the collected manual transcriptions. Experimental results confirmed that forced-alignment speech recognition can provide accurate and consistent VAD labels.
Year
Venue
Keywords
2015
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5
voice activity detection, speech recognition, speech segmentation
DocType
Citations 
PageRank 
Conference
0
0.34
References 
Authors
8
3
Name
Order
Citations
PageRank
Ivan Kraljevski174.00
Zheng-Hua Tan245760.32
Maria Paola Bissiri392.11