Title
Automatic punctuation and disfluency detection in multi-party meetings using prosodic and lexical cues
Abstract
We investigate automatic approaches to finding "hidden" sponta- neous speech events, such as sentence boundaries and disfluencies, in multi-party meetings. Hidden events are characterized prosod- ically by a large array of automatically extracted energy, dura- tion, and pitch features, and are modeled by decision tree classi- fiers; lexical cues are modeled by N-gram language models. Both sources of information are combined in a hidden Markov model framework. Results show that combined classifiers achieve higher accuracy than either single knowledge source alone. We also study classifiers that use only the preceding context for predicting events, simulating online processing. We find that prosodic features are more robust than are language model features to this constraint. Fi- nally, we examine the effect of automatic word recognition errors, in both training and testing, on classification accuracy. We find that lexical models degrade much more severely than do prosodic mod- els in this case, again showing the relative robustness of prosodic information for hidden-event detection in natural conversation.
Year
Venue
Keywords
2002
INTERSPEECH
decision tree,language model,hidden markov model,word recognition
Field
DocType
Citations 
Pattern recognition,Computer science,Speech recognition,Natural language processing,Artificial intelligence,Punctuation
Conference
26
PageRank 
References 
Authors
3.49
5
3
Name
Order
Citations
PageRank
Don Baron117024.35
Elizabeth Shriberg23057325.64
Andreas Stolcke36690712.46