Abstract | ||
---|---|---|
Usually, Web applications such as deep Web crawlers, metasearch engines, and other Web mining systems need to extract information displayed in the form of result records on response pages returned by search engines in response to submitted queries. Extracting such records is challenging as search engines are heterogeneous in displaying their records. In addition, response pages returned by many search engines include other noisy content such as advertisements, suggestion links, etc., which make the extraction task even more complicated. In this paper, we propose a highly effective and efficient algorithm for automatically mining result records from search engine response pages. |
Year | DOI | Venue |
---|---|---|
2005 | 10.1109/ICDM.2005.30 | ICDM |
Keywords | Field | DocType |
result record,suggestion link,extraction task,response page,automatically mining result,search engine,search engine response page,search engine response pages,efficient algorithm,web mining system,metasearch engine,deep web crawler,deep web,search engines,internet,web mining,data mining | Web search engine,Search aggregator,Metasearch engine,Web mining,Information retrieval,Computer science,Search engine indexing,Search analytics,Web crawler,Database,Spamdexing | Conference |
ISBN | Citations | PageRank |
0-7695-2278-5 | 0 | 0.34 |
References | Authors | |
9 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Dheerendranath Mundluru | 1 | 12 | 2.16 |
Jayasimha Reddy Katukuri | 2 | 0 | 0.34 |
Saygin Celebi | 3 | 0 | 0.34 |