Title
Video Tomographs and a Base Detector Selection Strategy for Improving Large-Scale Video Concept Detection
Abstract
In this paper, we deal with the problem of video concept detection to use the concept detection results toward a more effective concept-based video retrieval. The key novelties of this paper are as follows: 1) the use of spatio-temporal video slices (tomographs) in the same way that visual keyframes are typically used in video concept detection schemes. These spatio-temporal slices capture in a compact way motion patterns that are useful for detecting semantic concepts and are used for training a number of base detectors. The latter augment the set of keyframe-based base detectors that can be trained using different frame representations. 2) The introduction of a generic methodology, built upon a genetic algorithm, for controlling which subset of the available base detectors (consequently, which subset of the possible shot representations) should be combined for developing an optimal detector for each specific concept. This methodology is directly applicable to the learning of hundreds of diverse concepts, while diverging from the one-size-fits-all approach that is typically used in problems of this size. The proposed techniques are evaluated on the datasets of the 2011 and 2012 Semantic Indexing Task of TRECVID, each comprising several hundred hours of heterogeneous video clips and ground-truth annotations for tens of concepts that exhibit significant variation in terms of generality, complexity, and human participation. The experimental results manifest the merit of the proposed techniques.
Year
DOI
Venue
2014
10.1109/TCSVT.2014.2302554
Circuits and Systems for Video Technology, IEEE Transactions
Keywords
Field
DocType
video retrieval,video signal processing,base detector selection strategy,generic methodology,ground truth annotations,improving large-scale video concept detection,motion patterns,semantic indexing,spatio temporal slices,spatio temporal video slices,video clips,video retrieval,video tomographs,visual keyframes,Feature extraction,genetic algorithms,image sequence analysis,machine learning algorithms,video concept detection,video signal processing
Computer vision,Video processing,Block-matching algorithm,Video post-processing,Pattern recognition,Computer science,TRECVID,Motion compensation,Multiview Video Coding,Video tracking,Artificial intelligence,Video compression picture types
Journal
Volume
Issue
ISSN
24
7
1051-8215
Citations 
PageRank 
References 
3
0.40
27
Authors
3
Name
Order
Citations
PageRank
Panagiotis Sidiropoulos11188.77
V. Mezaris229316.26
Ioannis Kompatsiaris31404197.36