Title | ||
---|---|---|
Automatic Close Captioning for Live Hungarian Television Broadcast Speech: A Fast and Resource-Efficient Approach |
Abstract | ||
---|---|---|
In this paper, the application of LVCSR (Large Vocabulary Continuous Speech Recognition) technology is investigated for real-time, resource-limited broadcast close captioning. The work focuses on transcribing live broadcast conversation speech to make such programs accessible to deaf viewers. Due to computational limitations, real time factor (RTF) and memory requirements are kept low during decoding with various models tailored for Hungarian broadcast speech recognition. Two decoders are compared on the direct transcription task of broadcast conversation recordings, and setups employing re-speakers are also tested. Moreover, the models are evaluated on a broadcast news transcription task as well, and different language models (LMs) are tested in order to demonstrate the performance of our systems in settings when low memory consumption is a less crucial factor. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1007/978-3-319-23132-7_13 | Lecture Notes in Artificial Intelligence |
Keywords | Field | DocType |
Speech recognition,LVCSR,Broadcast news,Broadcast conversation,GMM,DNN,Hungarian,Kaldi,Limited resources | Real time factor,Transcription (linguistics),Broadcasting,Closed captioning,Conversation,Computer science,Speech recognition,Decoding methods,Vocabulary,Multimedia,Language model | Conference |
Volume | ISSN | Citations |
9319 | 0302-9743 | 0 |
PageRank | References | Authors |
0.34 | 6 | 7 |
Name | Order | Citations | PageRank |
---|---|---|---|
Ádam Varga | 1 | 1 | 0.73 |
Balázs Tarján | 2 | 21 | 4.92 |
Zoltán Tobler | 3 | 4 | 1.39 |
György Szaszák | 4 | 51 | 13.21 |
Tibor Fegyó | 5 | 61 | 10.46 |
Csaba Bordás | 6 | 0 | 0.34 |
Péter Mihajlik | 7 | 58 | 10.15 |