Abstract | ||
---|---|---|
The amount of web information is increasing rapidly with advanced wireless networks and emergence of diverse smart devices like i-Phone, i-Pad and so on. The information is continuously being produced and updated in anywhere and anytime by means of easy web platforms, and social networks. Now, it is becoming a hot issue how frequently updated web data has to be refreshed in data integration and retrieval domain. In this paper, we propose dynamic web-data crawling methods, which include sensitive checking of web site changes, and dynamic retrieving of web pages from target web sites. Furthermore, we implemented a java-based web crawling application and compared performance between conventional static approaches and our proposed dynamic ones. Our experiment results showed 59% performance benefits compared to static crawling method |
Year | DOI | Venue |
---|---|---|
2012 | 10.1109/ICOIN.2012.6164440 | ICOIN |
Keywords | Field | DocType |
updated web data,static crawling method,web page,java-based web,web site change,dynamic web-data,web crawler,dynamic web collection cycle,web information,dynamic retrieving,target web site,easy web platform,web pages,social network,databases,wireless network,internet,search engines,web crawling,data integrity,information retrieval,java,data integration,dynamic scheduling | Static web page,Web API,World Wide Web,Web page,Computer science,Data Web,Web modeling,Web navigation,Database,Distributed web crawling,Web server | Conference |
ISSN | ISBN | Citations |
1976-7684 | 978-1-4673-0251-7 | 0 |
PageRank | References | Authors |
0.34 | 0 | 5 |