Comparison Of Forced-Alignment Speech Recognition And Humans For Generating Reference Vad - Citegraph

Paper Info

Title
Comparison Of Forced-Alignment Speech Recognition And Humans For Generating Reference Vad

Abstract
This present paper aims to answer the question whether forced-alignment speech recognition can be used as an altemative to humans in generating reference Voice Activity Detection (VAD) transcriptions. An investigation of the level of agreement between automatic/manual VAD transcriptions and the reference ones produced by a human expert was carried out. Thereafter, statistical analysis was employed on the automatically produced and the collected manual transcriptions. Experimental results confirmed that forced-alignment speech recognition can provide accurate and consistent VAD labels.

Year	Venue	Keywords
2015	16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5	voice activity detection, speech recognition, speech segmentation
DocType	Citations	PageRank
Conference	0	0.34
References	Authors
8	3

Authors (3 rows)

Cited by (0 rows)

References (8 rows)

Name	Order	Citations	PageRank
Ivan Kraljevski	1	7	4.00
Zheng-Hua Tan	2	457	60.32
Maria Paola Bissiri	3	9	2.11

1