Title
A paragraph-inserted word salad filtering algorithm
Abstract
Social spam is one type of spam which includes spamming the members of social websites by sending or posting unwanted ads or baiting them to visit particular websites. Word salad in turn is one type of social spam which aims at baiting people to visit particular websites, such as blogs, personal profiles, third-party applications built on social networking websites, etc. A word salad is created by inserting either words or paragraphs within a normal document, where the inserted words or paragraphs have no relevance to the document. The purpose of a word salad is to fool the search engines into assigning high ranks to the document. In this paper, we discuss an algorithm that filters (detects) paragraph-inserted word salads. The algorithm is based on the Singular Value Decomposition (SVD) method and, based on experiments, shows up to 81.3% accuracy.
Year
DOI
Venue
2012
10.1504/IJWGS.2012.046730
IJWGS
Keywords
Field
DocType
singular value decomposition,social networking web,particular web,social spam,baiting people,word salad,social web,paragraph-inserted word salad,normal document,high rank
Social spam,Social network,Computer science,Filter (signal processing),Algorithm,Word salad,Paragraph,Forum spam,Spamming
Journal
Volume
Issue
ISSN
8
1
1741-1106
Citations 
PageRank 
References 
1
0.40
6
Authors
2
Name
Order
Citations
PageRank
Ok-Ran Jeong118122.02
Won Kim234131702.29