Abstract | ||
---|---|---|
Growing popularity of Web 2.0 systems created huge set of publicly available data, which is continuously expanded by users of the Internet. The anonymity of publications in Web systems encourages some users to publish false or illegal statements. Tools for identifying portal users, who publish such posts could result in higher quality of information and could be useful for law enforcement services. In this paper a method for finding similar Internet identities is introduced. Detected similarities can be used for finding several accounts of the same person. The method is based on calculating various measures characterizing forums users. It uses Web crawling system to collect data from forums. A prototype system for finding similar users is described and tests results are presented. |
Year | DOI | Venue |
---|---|---|
2012 | 10.1007/978-3-642-30721-8_36 | Communications in Computer and Information Science |
Keywords | Field | DocType |
Web crawling,identity analysis,text processing | Publication,Computer vision,World Wide Web,Computer science,Popularity,Artificial intelligence,Anonymity,Law enforcement,Web crawler,The Internet,Text processing,Information quality | Conference |
Volume | ISSN | Citations |
287 | 1865-0929 | 3 |
PageRank | References | Authors |
0.55 | 9 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
krzysztof wilaszek | 1 | 3 | 0.55 |
tomasz wojcik | 2 | 3 | 0.55 |
Andrzej Opalinski | 3 | 10 | 2.88 |
Wojciech Turek | 4 | 84 | 20.02 |