Title
Internet Identity Analysis and Similarities Detection
Abstract
Growing popularity of Web 2.0 systems created huge set of publicly available data, which is continuously expanded by users of the Internet. The anonymity of publications in Web systems encourages some users to publish false or illegal statements. Tools for identifying portal users, who publish such posts could result in higher quality of information and could be useful for law enforcement services. In this paper a method for finding similar Internet identities is introduced. Detected similarities can be used for finding several accounts of the same person. The method is based on calculating various measures characterizing forums users. It uses Web crawling system to collect data from forums. A prototype system for finding similar users is described and tests results are presented.
Year
DOI
Venue
2012
10.1007/978-3-642-30721-8_36
Communications in Computer and Information Science
Keywords
Field
DocType
Web crawling,identity analysis,text processing
Publication,Computer vision,World Wide Web,Computer science,Popularity,Artificial intelligence,Anonymity,Law enforcement,Web crawler,The Internet,Text processing,Information quality
Conference
Volume
ISSN
Citations 
287
1865-0929
3
PageRank 
References 
Authors
0.55
9
4
Name
Order
Citations
PageRank
krzysztof wilaszek130.55
tomasz wojcik230.55
Andrzej Opalinski3102.88
Wojciech Turek48420.02