Abstract | ||
---|---|---|
In the theme crawler, the Shark-Search algorithm is insufficient to consider the global Web page. In this paper, the PageRank algorithm is used to calculate the URL's authority to make up for this shortcoming, and Shark-PageRank algorithm, which adopts the anchor text, the context near the anchor text and authoritative value of Web page to measure the value of the URL, is proposed in this paper. The experiment results show that the new algorithm improves the speed and accuracy of the query, and the algorithm has good stability and scalability. |
Year | DOI | Venue |
---|---|---|
2016 | 10.1109/CCIS.2016.7790267 | 2016 4th International Conference on Cloud Computing and Intelligence Systems (CCIS) |
Keywords | Field | DocType |
Theme crawler,Shark-Search algorithm,PageRank algorithm,Vertical | Information retrieval,Web page,Computer science,Pagerank algorithm,Anchor text,Focused crawler,Web crawler,Scalability | Conference |
ISSN | ISBN | Citations |
2376-5933 | 978-1-5090-1257-2 | 1 |
PageRank | References | Authors |
0.44 | 1 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Lei Qiu | 1 | 44 | 10.23 |
Yuansheng Lou | 2 | 1 | 3.82 |
Min Chang | 3 | 1 | 0.78 |