Title
Metadata for mixed-media access
Abstract
In this paper, we discuss mixed-media access, an information access paradigm for multimedia data in which the media type of a query may differ from that of the data. The types of media considered in this paper are speech, images of text, and full-length text. Some examples of metadata for mixed-media access are locations of keywords in speech and images, identification of speakers, locations of emphasized regions in speech, and locations of topic boundaries in text. Algorithms for automatically generating this metadata are described, including word spotting, speaker segmentation, emphatic speech detection, and subtopic boundary location. We illustrate queries composed of diverse media types in an example of access to recorded meetings, via speaker and keyword location.
Year
DOI
Venue
1998
10.1145/190627.190646
Multimedia Data Management
Keywords
DocType
Volume
information access paradigm,mixed-media access,subtopic boundary location,multimedia data,keyword location,speaker segmentation,media type,emphatic speech detection,diverse media type,full-length text,speech detection
Journal
23
Issue
Citations 
PageRank 
4
10
3.90
References 
Authors
15
5
Name
Order
Citations
PageRank
Francine Chen11218153.96
Marti A. Hearst27014769.93
Julian Kupiec31061381.10
Jan O. Pedersen463011177.07
Lynn Wilcox51330180.16