Title
Novel Features for Web Spam Detection
Abstract
Recent research on web spam detection has shown promising results, and many new and efficient detection algorithms have been developed. While most research focuses on developing algorithms, our investigation shows that the features used in the algorithms are in fact very important, and different features can lead to very different results. This paper investigates three types of web spam, content-based, link-based and cloaking, and introduces new features for identifying the three types of spam. Our experimental results show that the introduction of new features significantly improves the detection performance.
Year
DOI
Venue
2016
10.1109/ICTAI.2016.0096
2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI)
Keywords
Field
DocType
Web Spam Detection,Content-based Spam Features,Link-based Spam Features,Cloaking Spam Features
Cloaking,Search engine,Web page,Information retrieval,Computer science,Feature extraction,Artificial intelligence,Machine learning,Benchmark (computing),Spamdexing
Conference
ISSN
ISBN
Citations 
1082-3409
978-1-5090-4460-3
0
PageRank 
References 
Authors
0.34
9
3
Name
Order
Citations
PageRank
Santosh Kumar171.86
Xiaoying Gao222032.95
Ian S. Welch312018.53