Title
Definition and Formalization of Entity Resolution Functions for Everyday Information Integration
Abstract
Data integration on a human-manageable scale, by users without database expertise, is a more common activity than integration of large databases. Users often gather fine-grained data and organize it in an entity-centric way, developing tables of information regarding real-world objects, ideas, or people. Often, they do this by copying and pasting bits of data from e-mails, databases, or text files into a spreadsheet. During this process, users evolve their notions of entities and attributes. They combine sets of entities or attributes, split them again, update attribute values, and retract those updates. These functions are neither well supported by current tools, nor formally well understood. Our research seeks to capture and make explicit the data integration decisions made during these activities. In this paper, we formally define entity resolution and de-resolution, and show that these functions behave predictably and intuitively in the presence of attribute value updates.
Year
DOI
Venue
2008
10.1007/978-3-540-88594-8_7
SDKB
Keywords
Field
DocType
entity resolution,common activity,data integration decision,large databases,update attribute value,database expertise,current tool,data integration,attribute value updates,everyday information integration,entity resolution functions,fine-grained data,data integrity,information integration
Data integration,Information integration,Name resolution,Information retrieval,Computer science,Copying,Relational algebra
Conference
Volume
ISSN
Citations 
4925
0302-9743
2
PageRank 
References 
Authors
0.41
10
2
Name
Order
Citations
PageRank
David W. Archer1505.28
Lois M. L. Delcambre2992420.78