Impact of Word Error Rate on theme identification task of highly imperfect human-human conversations. - Citegraph

Paper Info

Title
Impact of Word Error Rate on theme identification task of highly imperfect human-human conversations.

Abstract
HighlightsReview of the impact of dialogue representations and classification methods.We discuss the impact of discriminative words in terms of transcription accuracy.Original study evaluating the impact of the WER in the LDA topic space. A review is proposed of the impact of word representations and classification methods in the task of theme identification of telephone conversation services having highly imperfect automatic transcriptions. We firstly compare two word-based representations using the classical Term Frequency-Inverse Document Frequency with Gini purity criteria (TF-IDF-Gini) method and the latent Dirichlet allocation (LDA) approach. We then introduce a classification method that takes advantage of the LDA topic space representation, highlighted as the best word representation. To do so, two assumptions about topic representation led us to choose a Gaussian Process (GP) based method. Its performance is compared with a classical Support Vector Machine (SVM) classification method. Experiments showed that the GP approach is a better solution to deal with the multiple theme complexity of a dialogue, no matter the conditions studied (manual or automatic transcriptions) (Morchid et al., 2014). In order to better understand results obtained using different word representation methods and classification approaches, we then discuss the impact of discriminative and non-discriminative words extracted by both word representations methods in terms of transcription accuracy (Morchid et al., 2014). Finally, we propose a novel study that evaluates the impact of the Word Error Rate (WER) in the LDA topic space learning process as well as during the theme identification task. This original qualitative study points out that selecting a small subset of words having the lowest WER (instead of using all the words) allows the system to better classify automatic transcriptions with an absolute gain of 0.9 point, in comparison to the best performance achieved on this dialogue classification task (precision of 83.3%).

Year	DOI	Venue
2016	10.1016/j.csl.2015.12.001	Computer Speech & Language
Keywords	Field	DocType
Speech analytics,Human–human dialogue,Latent Dirichlet allocation,Topic representation,Principal component analysis,Classification performance study	Transcription (linguistics),Latent Dirichlet allocation,Conversation,Computer science,Gaussian process,Natural language processing,Artificial intelligence,Discriminative model,Speech analytics,Support vector machine,Word error rate,Speech recognition,Machine learning	Journal
Volume	Issue	ISSN
38	C	0885-2308
Citations	PageRank	References
2	0.36	26
Authors
3

Authors (3 rows)

Cited by (2 rows)

References (26 rows)

Name	Order	Citations	PageRank
Mohamed Morchid	1	84	22.79
richard dufour	2	98	23.98
georges linar es	3	136	29.55

1