Title
Transforming Web Tables to a Relational Database
Abstract
HTML tables represent a significant fraction of web data. The often complex headers of such tables are determined accurately using their indexing property. Isolated headers are factored to extract category hierarchies. Web tables are then transformed into a canonical form and imported into a relational database. The proposed processing allows for the formulation of arbitrary SQL queries over the collection of induced relational tables.
Year
DOI
Venue
2014
10.1109/ICPR.2014.479
ICPR
Keywords
Field
DocType
table segmentation, wang categories, header paths, relational table sql queries
SQL,Relational calculus,Conjunctive query,Database model,Relational database,Information retrieval,Computer science,Data definition language,Query by Example,Relational model
Conference
ISSN
Citations 
PageRank 
1051-4651
4
0.42
References 
Authors
15
3
Name
Order
Citations
PageRank
David W. Embley11915480.08
Sharad C. Seth267193.61
George Nagy3913105.94