Title
Architecture of a grid-enabled Web search engine
Abstract
Search Engine for South-East Europe (SE4SEE) is a socio-cultural search engine running on the grid infrastructure. It offers a personalized, on-demand, country-specific, category-based Web search facility. The main goal of SE4SEE is to attack the page freshness problem by performing the search on the original pages residing on the Web, rather than on the previously fetched copies as done in the traditional search engines. SE4SEE also aims to obtain high download rates in Web crawling by making use of the geographically distributed nature of the grid. In this work, we present the architectural design issues and implementation details of this search engine. We conduct various experiments to illustrate performance results obtained on a grid infrastructure and justify the use of the search strategy employed in SE4SEE.
Year
DOI
Venue
2007
10.1016/j.ipm.2006.10.011
Inf. Process. Manage.
Keywords
Field
DocType
web crawling,traditional search engine,socio-cultural search engine,search engine,search strategy,grid computing,text classification,grid-enabled web search engine,high download rate,south-east europe,category-based web search facility,grid infrastructure,architectural design issue,web search engine
Web search engine,Data mining,Computer science,Search-oriented architecture,Web search query,Metasearch engine,World Wide Web,Information retrieval,Semantic search,Search analytics,Web crawler,Database,Spamdexing
Journal
Volume
Issue
ISSN
43
3
Information Processing and Management
Citations 
PageRank 
References 
11
0.60
38
Authors
5
Name
Order
Citations
PageRank
B. Barla Cambazoglu173538.87
Evren Karaca2110.60
Tayfun Kucukyilmaz3564.10
Ata Turk47411.78
Cevdet Aykanat599684.08