Title
Arbitrary Category Classification of Websites Based on Image Content
Abstract
This paper presents a comprehensive methodology for general large-scale image-based classification tasks. It addresses the Big Data challenge in arbitrary image classification and more specifically, filtering of millions of websites with abstract target classes and high levels of label noise. Our approach uses local image features and their color descriptors to build image representations with the help of a modified k-NN algorithm. Image representations are refined into image and website class predictions by a two-stage classifier method suitable for a very large-scale real dataset. A modification of an Extreme Learning Machine is found to be a suitable classifier technique. The methodology is robust to noise and can learn abstract target categories; website classification accuracy surpasses 97% for the most important categories considered in this study.
Year
DOI
Venue
2015
10.1109/MCI.2015.2405317
IEEE Comp. Int. Mag.
Keywords
Field
DocType
learning artificial intelligence,classification,image classification,noise measurement,big data
Automatic image annotation,Pattern recognition,Feature detection (computer vision),Extreme learning machine,Image texture,Feature (computer vision),Computer science,Binary image,Artificial intelligence,Contextual image classification,Digital image processing,Machine learning
Journal
Volume
Issue
ISSN
10
2
1556-603X
Citations 
PageRank 
References 
2
0.37
0
Authors
6
Name
Order
Citations
PageRank
Anton Akusok114310.72
Yoan Miche2105454.56
Juha Karhunen3485.10
Kaj-Mikael Björk420.37
Rui Nian515912.18
Amaury Lendasse61876126.03