Title
Integrated Mining of Visual Features, Speech Features, and Frequent Patterns for Semantic Video Annotation
Abstract
To support effective multimedia information retrieval, video annotation has become an important topic in video content analysis. Existing video annotation methods put the focus on either the analysis of low-level features or simple semantic concepts, and they cannot reduce the gap between low-level features and high-level concepts. In this paper, we propose an innovative method for semantic video annotation through integrated mining of visual features, speech features, and frequent semantic patterns existing in the video. The proposed method mainly consists of two main phases: 1) Construction of four kinds of predictive annotation models, namely speech-association, visual-association, visual-sequential, and statistical models from annotated videos. 2) Fusion of these models for annotating un-annotated videos automatically. The main advantage of the proposed method lies in that all visual features, speech features, and semantic patterns are considered simultaneously. Moreover, the utilization of high-level rules can effectively complement the insufficiency of statistics-based methods in dealing with complex and broad keyword identification in video annotation. Through empirical evaluation on NIST TRECVID video datasets, the proposed approach is shown to enhance the performance of annotation substantially in terms of precision, recall, and F-measure.
Year
DOI
Venue
2008
10.1109/TMM.2007.911832
IEEE Transactions on Multimedia
Keywords
Field
DocType
Predictive models,Speech analysis,Information retrieval,Content based retrieval,Layout,Information analysis,Pattern analysis,NIST,Data mining,Image retrieval
Speech processing,Annotation,Information retrieval,Pattern recognition,Computer science,TRECVID,Image retrieval,Multimedia information retrieval,Information extraction,Video content analysis,Artificial intelligence,Semantics
Journal
Volume
Issue
ISSN
10
2
1520-9210
Citations 
PageRank 
References 
29
1.09
23
Authors
4
Name
Order
Citations
PageRank
Vincent S. Tseng12923161.33
Ja-Hwung Su232924.53
Jhih-Hong Huang3321.51
Chih-Jen Chen4352.40