Abstract | ||
---|---|---|
We present a novel technique for time-compressing speech, semantic compression, which uses an ASR transcript to determine which elements of the speech are presented. We carried out an exploratory user study comparing semantic compression to other novel types of time-compression techniques. We found that users feel they have a greater understanding of recordings compressed using semantic techniques than those compressed using acoustic-based techniques. An approach of using high playback speeds to indicate the location of 'insignificant' speech is not favoured by listeners, who prefer to have such segments removed from recordings. |
Year | DOI | Venue |
---|---|---|
2005 | 10.1109/ICASSP.2005.1415154 | 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING |
Keywords | Field | DocType |
nonlinear distortion,displays,semantic compression,exploratory study,frequency domain analysis,speech processing,automatic speech recognition,data compression,audio recording | Frequency domain,Speech processing,Pattern recognition,Computer science,Speech recognition,Artificial intelligence,Data compression,Nonlinear distortion,Sound recording and reproduction,Exploratory research,Semantic compression | Conference |
ISSN | Citations | PageRank |
1520-6149 | 8 | 0.58 |
References | Authors | |
6 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Simon Tucker | 1 | 187 | 13.18 |
Steve Whittaker | 2 | 5285 | 665.26 |