Title
Temporal Compression Of Speech: An Evaluation
Abstract
Efficient browsing of speech recordings is problematic. The linear nature of speech, coupled with the lack of abstraction that the medium affords, means that listeners have to listen to long segments of a recording to locate points of interest. We explore temporal compression algorithms that attempt to reduce the amount of time users require to listen to speech recordings, while retaining the important content. This paper implements two main approaches to temporal compression: artificial speech rate alteration (speed-up) and unimportant segment removal (excision). We evaluate the effectiveness of these approaches by having listeners rate comprehension and listening effort for different types of temporal compression. For different compression levels, we compare performance of various implementations of speed-up and excision as well as techniques based on semantic features and acoustic features. Our results indicate that listeners prefer low compression levels, excision over speed-up, and algorithms based on semantic rather than acoustic features. Finally, listeners were negative about hybrid algorithms that used speed-up to indicate missing regions in an excised recording.
Year
DOI
Venue
2008
10.1109/TASL.2008.916527
IEEE Transactions on Audio, Speech & Language Processing
Keywords
Field
DocType
different compression level,temporal compression,speech recording,acoustic feature,different type,excised recording,low compression level,listeners rate comprehension,artificial speech rate alteration,temporal compression algorithm,user requirements,user interfaces,displays,information retrieval,compression algorithm,audio recording,data compression,compression algorithms,hybrid algorithm,speech coding,user interface,speech processing,automatic speech recognition,point of interest
Speech processing,Speech coding,Pattern recognition,Computer science,Speech recognition,Artificial intelligence,Semantic feature,Point of interest,Data compression,Sound recording and reproduction,Word processing,Text processing
Journal
Volume
Issue
ISSN
16
4
1558-7916
Citations 
PageRank 
References 
4
0.42
13
Authors
2
Name
Order
Citations
PageRank
Simon Tucker118713.18
Steve Whittaker25285665.26