Title
Carpé data: supporting serendipitous data integration in personal information management
Abstract
The information processing capabilities of humans enable them to opportunistically draw and integrate knowledge from nearly any information source. However, the integration of digital, structured data from diverse sources remains difficult, due to problems of heterogeneity that arise when data modelled separately are brought together. In this paper, we present an investigation of the feasibility of extending Personal Information Management (PIM) tools to support lightweight, user-driven mixing of previously un-integrated data, with the objective of allowing users to take advantage of the emerging ecosystems of structured data currently becoming available. In this study, we conducted an exploratory, sequential, mixed-method investigation, starting with two pre-studies of the data integration needs and challenges, respectively, of Web-based data sources. Observations from these pre-studies led to DataPalette, an interface that introduced simple co-reference and group multi-path-selection mechanisms for working with terminologically and structurally heterogeneous data. Our lab study showed that participants readily understood the new interaction mechanisms which were introduced. Participants made more carefully justified decisions, even while weighing a greater number of factors, moreover expending less effort, during subjective-choice tasks when using DataPalette, than with a control set-up.
Year
DOI
Venue
2013
10.1145/2470654.2481324
CHI
Keywords
Field
DocType
un-integrated data,information source,serendipitous data integration,data integration need,personal information management,heterogeneous data,mixed-method investigation,lab study,structured data,web-based data source,information processing capability
Data integration,Information integration,World Wide Web,Information processing,Personal information management,Computer science,Group information management,Human–computer interaction,Enterprise information integration,Data model
Conference
Citations 
PageRank 
References 
9
0.53
17
Authors
5
Name
Order
Citations
PageRank
Max Van Kleek154258.95
Daniel A. Smith220416.69
Heather S. Packer3325.20
Jim Skinner490.53
Nigel Shadbolt54273321.53