Title
Towards spreadsheet integration using entity identification driven by a spatial-temporal model.
Abstract
Crucial to answering economic, social and political questions facing our society, data tends to be diverse and distributed through sites across the Internet. The creation of tools to integrate and analyze it is of paramount interest. Yet the automation of these processes continues to be a great challenge. Our work rests on the observation that a high number of public data sources for domains ranging from economic to demographic, although of complex structure, often share key similarities. One of these similarities is the presence of time and location, two core attribute types. Our proposed Data Integration through Object Modelling framework or DIOM tackles this problem of automating data integration from a variety of public websites by abstracting key features of multi-dimensional tables and interpreting them in the context of a spatial and temporal model. Our preliminary experimental results on real world data sets from heterogeneous public data sources show accuracy of over 93% in DIOM's entity identification.
Year
DOI
Venue
2016
10.1145/2851613.2851924
SAC 2016: Symposium on Applied Computing Pisa Italy April, 2016
Field
DocType
ISBN
Data integration,Data science,Data set,Computer science,Object model,Automation,The Internet
Conference
978-1-4503-3739-7
Citations 
PageRank 
References 
1
0.39
6
Authors
3
Name
Order
Citations
PageRank
Ramoza Ahsan163.53
Rodica Neamtu294.26
Elke A. Rundensteiner34076700.65