Title
Multimodal people ID for a multimedia meeting browser
Abstract
A meeting browser is a system that allows users to review a multimedia meeting record from a variety of indexing methods. Identification of meeting participants is essential for creating such a multimedia meeting record. Moreover, knowing who is speaking can enhance the performance of speech recognition and indexing meeting transcription. In this paper, we present an approach that identifies meeting participants by fusing multimodal inputs. We use face ID, speaker ID, color appearance ID, and sound source directional ID to identify and track meeting. After describing the different modules in detail, we will discuss a framework for combining the information sources. Integration of the multimodal people ID into the multimedia meeting browser is in its preliminary stage.
Year
DOI
Venue
1999
10.1145/319463.319484
ACM Multimedia (1)
Keywords
Field
DocType
indexing meeting transcription,multimodal people,speaker id,meeting participant,track meeting,multimedia meeting record,color appearance id,sound source directional id,face id,multimedia meeting browser,meeting browser,multimedia,data fusion,speech recognition,multimodal,indexation
World Wide Web,Computer science,Search engine indexing,Sensor fusion,Multimedia
Conference
ISBN
Citations 
PageRank 
1-58113-151-8
31
3.29
References 
Authors
14
6
Name
Order
Citations
PageRank
Jie Yang12856270.24
Xiaojin Zhu23586222.74
Ralph Gross328114.80
John Kominek426622.37
Yue Pan5434.10
Alex Waibel663431980.68