Abstract | ||
---|---|---|
Semantic event detection is an active research field of video mining in recent years. One of the challenging problems is how to effectively model temporal and multi-modality characteristics of video. In this paper, we employ Conditional Random Fields (CRFs) to fuse temporal multi-modality cues for event detection. CRFs are undirected probabilistic models designed for segmenting and labeling sequence data. Compared with traditional SVM and Hidden Markov Models (HMMs), CRFs based event detection offers several particular advantages including the abilities to relax strong independence assumptions in the state transition and avoid a fundamental limitation of directed graphical models. To detect event, we use a three-level framework based on multi-modality fusion and mid-level keywords. The first level extracts audiovisual features, the mid-level detects semantic keywords, and the high-level infers semantic events from multiple keyword sequences. The experimental results from soccer highlights detection demonstrate that CRFs achieves better performance particularly in slice level measure. |
Year | DOI | Venue |
---|---|---|
2006 | 10.1109/CVPRW.2006.190 | CVPR Workshops |
Field | DocType | Volume |
Conditional random field,Pattern recognition,Computer science,Support vector machine,Artificial intelligence,Decoding methods,Graphical model,Probabilistic logic,Hidden Markov model,Machine learning,CRFS,Bayesian probability | Conference | 2006 |
Issue | ISSN | ISBN |
null | 2160-7508 | 0-7695-2646-2 |
Citations | PageRank | References |
27 | 0.90 | 10 |
Authors | ||
6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Tao Wang | 1 | 292 | 14.48 |
Jianguo Li | 2 | 377 | 35.38 |
Qian Diao | 3 | 59 | 4.53 |
Wei Hu | 4 | 182 | 14.17 |
Yimin Zhang | 5 | 1536 | 130.17 |
Carole Dulong | 6 | 100 | 6.29 |