Title
Automatic Close Captioning for Live Hungarian Television Broadcast Speech: A Fast and Resource-Efficient Approach
Abstract
In this paper, the application of LVCSR (Large Vocabulary Continuous Speech Recognition) technology is investigated for real-time, resource-limited broadcast close captioning. The work focuses on transcribing live broadcast conversation speech to make such programs accessible to deaf viewers. Due to computational limitations, real time factor (RTF) and memory requirements are kept low during decoding with various models tailored for Hungarian broadcast speech recognition. Two decoders are compared on the direct transcription task of broadcast conversation recordings, and setups employing re-speakers are also tested. Moreover, the models are evaluated on a broadcast news transcription task as well, and different language models (LMs) are tested in order to demonstrate the performance of our systems in settings when low memory consumption is a less crucial factor.
Year
DOI
Venue
2015
10.1007/978-3-319-23132-7_13
Lecture Notes in Artificial Intelligence
Keywords
Field
DocType
Speech recognition,LVCSR,Broadcast news,Broadcast conversation,GMM,DNN,Hungarian,Kaldi,Limited resources
Real time factor,Transcription (linguistics),Broadcasting,Closed captioning,Conversation,Computer science,Speech recognition,Decoding methods,Vocabulary,Multimedia,Language model
Conference
Volume
ISSN
Citations 
9319
0302-9743
0
PageRank 
References 
Authors
0.34
6
7
Name
Order
Citations
PageRank
Ádam Varga110.73
Balázs Tarján2214.92
Zoltán Tobler341.39
György Szaszák45113.21
Tibor Fegyó56110.46
Csaba Bordás600.34
Péter Mihajlik75810.15