Title
Grubber: Allowing End-Users to Develop XML-Based Wrappers for Web Data Sources
Abstract
There exist numerous online data sources on the Web. It is desirable to facilitate end-users to build XML-based wrappers from the data sources for further composition and reuse. This paper describes Grubber, a tool that allows end-users to develop XML-based wrappers from these data sources with just a few mouse clicks and keystrokes. An active learning algorithm was proposed and implemented to reduce end-users' effort. Experimental results on real-world sites show that the algorithm can achieve a high degree of effectiveness. Compared with other similar tools, Grubber includes a number of usability improvements to lower the barrier of usage and we believe it is suitable for mass end-users to build situational applications.
Year
DOI
Venue
2009
10.1007/978-3-642-00672-2_65
APWeb/WAIM
Keywords
Field
DocType
web data sources,real-world site,high degree,active learning algorithm,xml-based wrapper,mouse click,numerous online data source,mass end-users,data source,xml-based wrappers,similar tool,active learning,xml
World Wide Web,Active learning,XML,End user,Reuse,Computer science,Usability,Data extraction,Database
Conference
Volume
ISSN
Citations 
5446
0302-9743
5
PageRank 
References 
Authors
0.56
8
3
Name
Order
Citations
PageRank
Shaohua Yang1294.88
Guiling Wang283252.06
Yanbo Han350059.74