NLP on spoken documents without ASR - Citegraph

Paper Info

Title
NLP on spoken documents without ASR

Abstract
There is considerable interest in interdisciplinary combinations of automatic speech recognition (ASR), machine learning, natural language processing, text classification and information retrieval. Many of these boxes, especially ASR, are often based on considerable linguistic resources. We would like to be able to process spoken documents with few (if any) resources. Moreover, connecting black boxes in series tends to multiply errors, especially when the key terms are out-of-vocabulary (OOV). The proposed alternative applies text processing directly to the speech without a dependency on ASR. The method finds long (~ 1 sec) repetitions in speech, and clusters them into pseudo-terms (roughly phrases). Document clustering and classification work surprisingly well on pseudo-terms; performance on a Switchboard task approaches a baseline using gold standard manual transcriptions.

Year	Venue	Keywords
2010	EMNLP	document clustering,classification work,automatic speech recognition,text classification,natural language processing,text processing,switchboard task,black box,considerable linguistic resource,considerable interest
Field	DocType	Volume
Transcription (linguistics),Computer science,Document clustering,Speech recognition,Natural language processing,Artificial intelligence,Black box,Text processing	Conference	D10-1
Citations	PageRank	References
29	1.54	17
Authors
4

Authors (4 rows)

Cited by (29 rows)

References (17 rows)

Name	Order	Citations	PageRank
Mark Dredze	1	3092	176.22
Lorena Álvarez	2	504	36.47
Glen Coppersmith	3	336	19.39
Ken Church	4	37	3.74

1