Video Tomographs and a Base Detector Selection Strategy for Improving Large-Scale Video Concept Detection - Citegraph

Paper Info

Title
Video Tomographs and a Base Detector Selection Strategy for Improving Large-Scale Video Concept Detection

Abstract
In this paper, we deal with the problem of video concept detection to use the concept detection results toward a more effective concept-based video retrieval. The key novelties of this paper are as follows: 1) the use of spatio-temporal video slices (tomographs) in the same way that visual keyframes are typically used in video concept detection schemes. These spatio-temporal slices capture in a compact way motion patterns that are useful for detecting semantic concepts and are used for training a number of base detectors. The latter augment the set of keyframe-based base detectors that can be trained using different frame representations. 2) The introduction of a generic methodology, built upon a genetic algorithm, for controlling which subset of the available base detectors (consequently, which subset of the possible shot representations) should be combined for developing an optimal detector for each specific concept. This methodology is directly applicable to the learning of hundreds of diverse concepts, while diverging from the one-size-fits-all approach that is typically used in problems of this size. The proposed techniques are evaluated on the datasets of the 2011 and 2012 Semantic Indexing Task of TRECVID, each comprising several hundred hours of heterogeneous video clips and ground-truth annotations for tens of concepts that exhibit significant variation in terms of generality, complexity, and human participation. The experimental results manifest the merit of the proposed techniques.

Year	DOI	Venue
2014	10.1109/TCSVT.2014.2302554	Circuits and Systems for Video Technology, IEEE Transactions
Keywords	Field	DocType
video retrieval,video signal processing,base detector selection strategy,generic methodology,ground truth annotations,improving large-scale video concept detection,motion patterns,semantic indexing,spatio temporal slices,spatio temporal video slices,video clips,video retrieval,video tomographs,visual keyframes,Feature extraction,genetic algorithms,image sequence analysis,machine learning algorithms,video concept detection,video signal processing	Computer vision,Video processing,Block-matching algorithm,Video post-processing,Pattern recognition,Computer science,TRECVID,Motion compensation,Multiview Video Coding,Video tracking,Artificial intelligence,Video compression picture types	Journal
Volume	Issue	ISSN
24	7	1051-8215
Citations	PageRank	References
3	0.40	27
Authors
3

Authors (3 rows)

Cited by (3 rows)

References (27 rows)

Name	Order	Citations	PageRank
Panagiotis Sidiropoulos	1	118	8.77
V. Mezaris	2	293	16.26
Ioannis Kompatsiaris	3	1404	197.36

1