Title | ||
---|---|---|
Automatic punctuation and disfluency detection in multi-party meetings using prosodic and lexical cues |
Abstract | ||
---|---|---|
We investigate automatic approaches to finding "hidden" sponta- neous speech events, such as sentence boundaries and disfluencies, in multi-party meetings. Hidden events are characterized prosod- ically by a large array of automatically extracted energy, dura- tion, and pitch features, and are modeled by decision tree classi- fiers; lexical cues are modeled by N-gram language models. Both sources of information are combined in a hidden Markov model framework. Results show that combined classifiers achieve higher accuracy than either single knowledge source alone. We also study classifiers that use only the preceding context for predicting events, simulating online processing. We find that prosodic features are more robust than are language model features to this constraint. Fi- nally, we examine the effect of automatic word recognition errors, in both training and testing, on classification accuracy. We find that lexical models degrade much more severely than do prosodic mod- els in this case, again showing the relative robustness of prosodic information for hidden-event detection in natural conversation. |
Year | Venue | Keywords |
---|---|---|
2002 | INTERSPEECH | decision tree,language model,hidden markov model,word recognition |
Field | DocType | Citations |
Pattern recognition,Computer science,Speech recognition,Natural language processing,Artificial intelligence,Punctuation | Conference | 26 |
PageRank | References | Authors |
3.49 | 5 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Don Baron | 1 | 170 | 24.35 |
Elizabeth Shriberg | 2 | 3057 | 325.64 |
Andreas Stolcke | 3 | 6690 | 712.46 |