Title
A Graphical Model for Audiovisual Object Tracking
Abstract
We present a new approach to modeling and processing multimedia data. This approach is based on graphical models that combine audio and video variables. We demonstrate it by developing a new algorithm for tracking a moving object in a cluttered, noisy scene using two microphones and a camera. Our model uses unobserved variables to describe the data in terms of the process that generates them. It is therefore able to capture and exploit the statistical structure of the audio and video data separately, as well as their mutual dependencies. Model parameters are learned from data via an EM algorithm, and automatic calibration is performed as part of this procedure. Tracking is done by Bayesian inference of the object location from data. We demonstrate successful performance on multimedia clips captured in real world scenarios using off-the-shelf equipment.
Year
DOI
Venue
2003
10.1109/TPAMI.2003.1206512
IEEE Trans. Pattern Anal. Mach. Intell.
Keywords
Field
DocType
em algorithm,graphical models,bayesian methods,computer graphics,graphical model,multimedia,variational method,tracking,video,multimodal,speech,background noise,bayesian inference,vision,object tracking,expectation maximization,expectation maximization algorithm,pattern recognition,calibration,probability,indexing terms
Probabilistic inference,Computer vision,Bayesian inference,Expectation–maximization algorithm,Computer science,Exploit,Video tracking,Artificial intelligence,Graphical model,Computer graphics,Calibration
Journal
Volume
Issue
ISSN
25
7
0162-8828
Citations 
PageRank 
References 
61
3.03
20
Authors
3
Name
Order
Citations
PageRank
Matthew J. Beal160064.31
Nebojsa Jojic21397165.68
Hagai Attias385696.36