Abstract | ||
---|---|---|
There exist numerous online data sources on the Web. It is desirable to facilitate end-users to build XML-based wrappers from the data sources for further composition and reuse. This paper describes Grubber, a tool that allows end-users to develop XML-based wrappers from these data sources with just a few mouse clicks and keystrokes. An active learning algorithm was proposed and implemented to reduce end-users' effort. Experimental results on real-world sites show that the algorithm can achieve a high degree of effectiveness. Compared with other similar tools, Grubber includes a number of usability improvements to lower the barrier of usage and we believe it is suitable for mass end-users to build situational applications. |
Year | DOI | Venue |
---|---|---|
2009 | 10.1007/978-3-642-00672-2_65 | APWeb/WAIM |
Keywords | Field | DocType |
web data sources,real-world site,high degree,active learning algorithm,xml-based wrapper,mouse click,numerous online data source,mass end-users,data source,xml-based wrappers,similar tool,active learning,xml | World Wide Web,Active learning,XML,End user,Reuse,Computer science,Usability,Data extraction,Database | Conference |
Volume | ISSN | Citations |
5446 | 0302-9743 | 5 |
PageRank | References | Authors |
0.56 | 8 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Shaohua Yang | 1 | 29 | 4.88 |
Guiling Wang | 2 | 832 | 52.06 |
Yanbo Han | 3 | 500 | 59.74 |