Abstract | ||
---|---|---|
For the purpose of obtaining deep web query interface from forms accurately, this paper proposes a framework of automatic deep web discovery, which includes procedures of collecting web pages, extracting forms and features, filtering forms, and identifying forms. A heuristic rule-based k-nearest neighbor algorithm for identifying the query interfaces is introduced. In the experiments, a number of query interfaces and non-query interfaces from different domains are selected for classifying the query interfaces. Experimental results demonstrate that the presented algorithm can significantly improve the accuracy of deep web query interface discovery. |
Year | Venue | Keywords |
---|---|---|
2015 | 2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD) | Deep Web, query interface, k- nearest neighbor algorithm |
Field | DocType | Citations |
Query optimization,Web search query,Query language,Heuristic,Information retrieval,Query expansion,Web page,Computer science,Sargable,Web query classification | Conference | 0 |
PageRank | References | Authors |
0.34 | 11 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Bo Liu | 1 | 123 | 6.65 |
Zhenxing Li | 2 | 0 | 0.34 |