Title | ||
---|---|---|
Definition and Formalization of Entity Resolution Functions for Everyday Information Integration |
Abstract | ||
---|---|---|
Data integration on a human-manageable scale, by users without database expertise, is a more common activity than integration of large databases. Users often gather fine-grained data and organize it in an entity-centric way, developing tables of information regarding real-world objects, ideas, or people. Often, they do this by copying and pasting bits of data from e-mails, databases, or text files into a spreadsheet. During this process, users evolve their notions of entities and attributes. They combine sets of entities or attributes, split them again, update attribute values, and retract those updates. These functions are neither well supported by current tools, nor formally well understood. Our research seeks to capture and make explicit the data integration decisions made during these activities. In this paper, we formally define entity resolution and de-resolution, and show that these functions behave predictably and intuitively in the presence of attribute value updates. |
Year | DOI | Venue |
---|---|---|
2008 | 10.1007/978-3-540-88594-8_7 | SDKB |
Keywords | Field | DocType |
entity resolution,common activity,data integration decision,large databases,update attribute value,database expertise,current tool,data integration,attribute value updates,everyday information integration,entity resolution functions,fine-grained data,data integrity,information integration | Data integration,Information integration,Name resolution,Information retrieval,Computer science,Copying,Relational algebra | Conference |
Volume | ISSN | Citations |
4925 | 0302-9743 | 2 |
PageRank | References | Authors |
0.41 | 10 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
David W. Archer | 1 | 50 | 5.28 |
Lois M. L. Delcambre | 2 | 992 | 420.78 |