Title | ||
---|---|---|
Comparison Of Forced-Alignment Speech Recognition And Humans For Generating Reference Vad |
Abstract | ||
---|---|---|
This present paper aims to answer the question whether forced-alignment speech recognition can be used as an altemative to humans in generating reference Voice Activity Detection (VAD) transcriptions. An investigation of the level of agreement between automatic/manual VAD transcriptions and the reference ones produced by a human expert was carried out. Thereafter, statistical analysis was employed on the automatically produced and the collected manual transcriptions. Experimental results confirmed that forced-alignment speech recognition can provide accurate and consistent VAD labels. |
Year | Venue | Keywords |
---|---|---|
2015 | 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5 | voice activity detection, speech recognition, speech segmentation |
DocType | Citations | PageRank |
Conference | 0 | 0.34 |
References | Authors | |
8 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Ivan Kraljevski | 1 | 7 | 4.00 |
Zheng-Hua Tan | 2 | 457 | 60.32 |
Maria Paola Bissiri | 3 | 9 | 2.11 |