Abstract | ||
---|---|---|
In this paper we describe our early exploration of automatic recognition of conversational speech in meetings for use in automatic summarizers and browsers to produce meeting minutes effectively and rapidly. To achieve optimal performance we started from two different baseline English recognizers adapted to meeting conditions and tested resulting performance. The data were found to be highly disfluent (conversational human to human speech), noisy (due to lapel microphones and environment), and overlapped with background noise, resulting in error rates comparable so far to those on the CallHome conversational database (40-50% WER). A meeting browser is presented that allows the user to search and skim through highlights from a meeting efficiently despite the recognition errors. |
Year | DOI | Venue |
---|---|---|
1998 | 10.1109/ICASSP.1998.675416 | PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6 |
Keywords | Field | DocType |
automatic summarization,background noise,error rate,performance,human speech,databases,user interfaces,automatic speech recognition,speech recognition,testing | Speech processing,Background noise,Speech analytics,Computer science,Speech recognition,Artificial intelligence,Natural language processing,User interface,Speech technology | Conference |
ISSN | Citations | PageRank |
1520-6149 | 9 | 4.15 |
References | Authors | |
5 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Hua Yu | 1 | 9 | 4.15 |
Cortis Clark | 2 | 9 | 4.15 |
Robert Malkin | 3 | 85 | 10.30 |
Alex Waibel | 4 | 6343 | 1980.68 |