Improved recognition of spontaneous Hungarian speech: morphological and acoustic modeling techniques for a less resourced task - Citegraph

Paper Info

Title
Improved recognition of spontaneous Hungarian speech: morphological and acoustic modeling techniques for a less resourced task

Abstract
Various morphological and acoustic modeling techniques are evaluated on a less resourced, spontaneous Hungarian large-vocabulary continuous speech recognition (LVCSR) task. Among morphologically rich languages, Hungarian is known for its agglutinative, inflective nature that increases the data sparseness caused by a relatively small training database. Although Hungarian spelling is considered as simple phonological, a large part of the corpus is covered by words pronounced in multiple, phonemically different ways. Data-driven and language specific knowledge supported vocabulary decomposition methods are investigated in combination with phoneme- and grapheme-based acoustic modeling techniques on the given task. Word baseline and morph-based advanced baseline results are significantly outperformed by using both statistical and grammatical vocabulary decomposition methods. Although the discussed morph-based techniques recognize a significant amount of out of vocabulary words, the improvements are due not to this fact but to the reduction of insertion errors. Applying grapheme-based acoustic models instead of phoneme-based models causes no severe recognition performance deteriorations. Moreover, a fully data-driven acoustic modeling technique along with a statistical morphological modeling approach provides the best performance on the most difficult test set. The overall best speech recognition performance is obtained by using a novel word to morph decomposition technique that combines grammatical and unsupervised statistical segmentation algorithms. The improvement achieved by the proposed technique is stable across acoustic modeling approaches and larger with speaker adaptation.

Year	DOI	Venue
2010	10.1109/TASL.2009.2038807	IEEE Transactions on Audio, Speech & Language Processing
Keywords	Field	DocType
grapheme-based acoustic modeling technique,statistical morphological modeling approach,data-driven acoustic modeling technique,acoustic modeling techniques,acoustic modeling technique,resourced task,hungarian large-vocabulary continuous speech,hungarian spelling,decomposition technique,improved recognition,acoustic modeling approach,best performance,spontaneous hungarian speech,grapheme-based acoustic model,decomposition method,loudspeakers,automatic speech recognition,language model,databases,natural languages,speech recognition,informatics,writing	Computer science,Grapheme,Segmentation,Agglutinative language,Speech recognition,Natural language,Artificial intelligence,Natural language processing,Phonology,Loudspeaker,Vocabulary,Test set	Journal
Volume	Issue	ISSN
18	6	1558-7916
Citations	PageRank	References
9	0.79	31
Authors
5

Authors (5 rows)

Cited by (9 rows)

References (31 rows)

Name	Order	Citations	PageRank
Péter Mihajlik	1	58	10.15
Zoltán Tüske	2	119	17.32
Balázs Tarján	3	21	4.92
Bottyán Németh	4	312	40.04
Tibor Fegyó	5	61	10.46

1