Title
Generic Visual Categorization Using Weak Geometry
Abstract
In the first part of this chapter we make a general presentation of the bag-of-keypatches approach to generic visual categorization (GVC). Our approach is inspired by the bag-of-words approach to text categorization. This method is able to identify the object content of natural images while generalizing across variations inherent to the object class. To obtain a visual vocabulary insensitive to viewpoint and illumination, rotation or affine invariant orientation histogram descriptors of image patches are vector quantized. Each image is then represented by one visual word occurrence histogram. To classify the images we use one-against-all SVM classifiers and choose the best ranked category. The main advantages of the method are that it is simple, computationally efficient and intrinsically invariant. We obtained excellent results as well for multi-class categorization as for object detection. In the second part we improve the categorizer by incorporating geometric information. Based on scale, orientation or closeness of the keypatches we can consider a large number of simple geometrical relationships, each of which can be considered as a simplistic classifier. We select from this multitude of classifiers (several millions in our case) and combine them effectively with the original classifier. Results are shown on a new challenging 10 class dataset.
Year
DOI
Venue
2006
10.1007/11957959_11
Lecture Notes in Computer Science
Keywords
Field
DocType
bag of words
Affine transformation,Histogram,Object detection,Categorization,Computer vision,Pattern recognition,Computer science,Support vector machine,Invariant (mathematics),Artificial intelligence,Mixture model,Visual Word
Conference
Volume
ISSN
Citations 
4170
0302-9743
1
PageRank 
References 
Authors
0.34
20
4
Name
Order
Citations
PageRank
Gabriela Csurka197285.08
C.R. Dance293662.87
Florent Perronnin35448291.48
Jutta Willamowski45212.58