Lessons learned in part-of-speech tagging of conversational speech - Citegraph

Paper Info

Title
Lessons learned in part-of-speech tagging of conversational speech

Abstract
This paper examines tagging models for spontaneous English speech transcripts. We analyze the performance of state-of-the-art tagging models, either generative or discriminative, left-to-right or bidirectional, with or without latent annotations, together with the use of ToBI break indexes and several methods for segmenting the speech transcripts (i.e., conversation side, speaker turn, or human-annotated sentence). Based on these studies, we observe that: (1) bidirectional models tend to achieve better accuracy levels than left-to-right models, (2) generative models seem to perform somewhat better than discriminative models on this task, and (3) prosody improves tagging performance of models on conversation sides, but has much less impact on smaller segments. We conclude that, although the use of break indexes can indeed significantly improve performance over baseline models without them on conversation sides, tagging accuracy improves more by using smaller segments, for which the impact of the break indexes is marginal.

Year	Venue	Keywords
2010	EMNLP	discriminative model,bidirectional model,smaller segment,better accuracy level,break index,conversation side,tobi break index,state-of-the-art tagging model,conversational speech,part-of-speech tagging,generative model,tagging accuracy,english language,dormancy,speech,indexes
Field	DocType	Volume
Prosody,Conversation,English language,Computer science,Part-of-speech tagging,Speech recognition,Natural language processing,Artificial intelligence,Generative grammar,Discriminative model,Sentence	Conference	D10-1
Citations	PageRank	References
5	0.53	22
Authors
3

Authors (3 rows)

Cited by (5 rows)

References (22 rows)

Name	Order	Citations	PageRank
Vladimir Eidelman	1	323	17.61
Zhongqiang Huang	2	217	20.41
Mary Harper	3	258	20.54

1