Abstract | ||
---|---|---|
We present a method for rapidly and robustly extracting audio excerpts without the overhead of speech recognition or speaker segmentation. An immediate application is to automatically augment keyframe-based video summaries with informative audio excerpts associated with the video segments represented by the keyframes. Short audio clips combined with keyframes comprise an extremely lightweight and Web-browsable interface for auditioning video or similar media, without using bandwidth-intensive streaming video or audio. |
Year | DOI | Venue |
---|---|---|
2002 | 10.1109/ICME.2002.1035604 | Multimedia and Expo, 2002. ICME '02. Proceedings. 2002 IEEE International Conference |
Keywords | Field | DocType |
audio signal processing,feature extraction,image retrieval,image segmentation,video databases,video signal processing,Manga system,Web-browsable interface,audio clips,audio excerpts,automatically augment keyframe-based video summaries,automatically extracted audio excerpts,bandwidth-intensive audio streaming,bandwidth-intensive video streaming,keyframes,multimedia browsing,multimedia documents,video browsing,video media auditioning,video segmentation,video segments | Video browsing,Computer vision,Video processing,Video capture,Audio mining,Computer science,Non-linear editing system,Speech recognition,Video tracking,Smacker video,Artificial intelligence,Audio signal processing | Conference |
Volume | Citations | PageRank |
2 | 2 | 1.00 |
References | Authors | |
8 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Jonathan Foote | 1 | 1625 | 176.16 |
Matthew Cooper | 2 | 798 | 76.01 |
Lynn Wilcox | 3 | 1330 | 180.16 |