Title
An Approach For Focused Crawler To Harvest Digital Academic Documents In Online Digital Libraries
Abstract
With the rapid growth of digital information and user need, it becomes imperative to retrieve relevant and desired domain or topic specific documents as per the user query quickly. A focused crawler plays a vital role in digital libraries to crawl the web so that researchers can easily explore the domain specific search results list and find the desired content against the query. In this article, a focused crawler is being proposed for online digital library search engines, which considers meta-data of the query in order to retrieve the corresponding document or other relevant but missing information (e.g. paid publication from ACM, IEEE, etc.) against the user query. The different query strategies are made by using the meta-data and submitted to different search engines which aim to find more relevant information which is missing. The result comes out from these search engines are filtered and then used further for crawling the Web.
Year
DOI
Venue
2019
10.4018/IJIRR.2019070103
INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH
Keywords
DocType
Volume
Digital library, Focused Crawler, Search engine, Topic Specific, WWW
Journal
9
Issue
ISSN
Citations 
3
2155-6377
0
PageRank 
References 
Authors
0.34
0
3
Name
Order
Citations
PageRank
Sumita Gupta100.34
Neelam Duhan200.34
Poonam Bansal302.03