Title
Research on theme crawler based on Shark-Search and PageRank algorithm
Abstract
In the theme crawler, the Shark-Search algorithm is insufficient to consider the global Web page. In this paper, the PageRank algorithm is used to calculate the URL's authority to make up for this shortcoming, and Shark-PageRank algorithm, which adopts the anchor text, the context near the anchor text and authoritative value of Web page to measure the value of the URL, is proposed in this paper. The experiment results show that the new algorithm improves the speed and accuracy of the query, and the algorithm has good stability and scalability.
Year
DOI
Venue
2016
10.1109/CCIS.2016.7790267
2016 4th International Conference on Cloud Computing and Intelligence Systems (CCIS)
Keywords
Field
DocType
Theme crawler,Shark-Search algorithm,PageRank algorithm,Vertical
Information retrieval,Web page,Computer science,Pagerank algorithm,Anchor text,Focused crawler,Web crawler,Scalability
Conference
ISSN
ISBN
Citations 
2376-5933
978-1-5090-1257-2
1
PageRank 
References 
Authors
0.44
1
3
Name
Order
Citations
PageRank
Lei Qiu14410.23
Yuansheng Lou213.82
Min Chang310.78