Abstract | ||
---|---|---|
Refreshing on-line information in time is a main task of incremental crawling, so it is very important to predict the change frequency of web pages. We model the change of page as a Poisson process in this paper. And based on it, we propose a parameter-adjustable algorithm after considering the unbiasedness, efficiency and consistency comprehensively. This algorithm can adjust the parameters in order to estimate the change frequency more effective. |
Year | DOI | Venue |
---|---|---|
2010 | 10.1109/ICEE.2010.352 | ICEE |
Keywords | Field | DocType |
web page,main task,consistency comprehensively,change frequency,refreshing on-line information,poisson process,parameter-adjustable algorithm,parameter-adjustable estimating method,incremental crawling,web pages,time frequency analysis,information management,mathematical model,prediction algorithms,stochastic processes,history | Data mining,Information management,Crawling,Web page,Computer science,Stochastic process,Prediction algorithms,Time–frequency analysis,Poisson process | Conference |
Citations | PageRank | References |
0 | 0.34 | 3 |
Authors | ||
2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Shuchen Tan | 1 | 0 | 0.34 |
Xuan Zhang | 2 | 110 | 18.58 |