Abstract | ||
---|---|---|
In this paper, we discuss mixed-media access, an information access paradigm for multimedia data in which the media type of a query may differ from that of the data. The types of media considered in this paper are speech, images of text, and full-length text. Some examples of metadata for mixed-media access are locations of keywords in speech and images, identification of speakers, locations of emphasized regions in speech, and locations of topic boundaries in text. Algorithms for automatically generating this metadata are described, including word spotting, speaker segmentation, emphatic speech detection, and subtopic boundary location. We illustrate queries composed of diverse media types in an example of access to recorded meetings, via speaker and keyword location. |
Year | DOI | Venue |
---|---|---|
1998 | 10.1145/190627.190646 | Multimedia Data Management |
Keywords | DocType | Volume |
information access paradigm,mixed-media access,subtopic boundary location,multimedia data,keyword location,speaker segmentation,media type,emphatic speech detection,diverse media type,full-length text,speech detection | Journal | 23 |
Issue | Citations | PageRank |
4 | 10 | 3.90 |
References | Authors | |
15 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Francine Chen | 1 | 1218 | 153.96 |
Marti A. Hearst | 2 | 7014 | 769.93 |
Julian Kupiec | 3 | 1061 | 381.10 |
Jan O. Pedersen | 4 | 6301 | 1177.07 |
Lynn Wilcox | 5 | 1330 | 180.16 |