Title
Unified structure and content search for personal information management systems
Abstract
User data stored in personal information systems is growing massively. Simultaneously, this data is increasingly distributed across multiple organizational domains such as email, music databases, and photo albums, some of which are structured automatically by applications. Powerful search tools are needed to help users locate data in these expanding yet fragmented data sets. In this paper, we present a novel fuzzy search approach that considers approximate matches to structure and content query conditions. Our framework uses unified data and query processing models so that structure conditions can be approximately matched by content and vice versa. Our models also unify external structure (e.g., directories) with internal structure (e.g., XML structure), supporting integrated queries matched to a single data domain. We propose indexes and algorithms for efficient query processing. We evaluate our approach using a real data set, showing that it can leverage structure information to significantly improve search accuracy, yet is robust to mistakes in query conditions.
Year
DOI
Venue
2011
10.1145/1951365.1951391
EDBT
Keywords
Field
DocType
xml structure,query path matching,unified structure,personal information management system,data structure,unified data,search tool,structure information,additional information,content information,content search,directory structure,structure condition,unify external structure,query processing,single data domain,content component,personal information system,directory information,personal information search,user data,structure and content search,internal structure,personal information,fragmented data set,file boundary,personal information management,computer science,information system
Query optimization,Web search query,Data mining,Query language,Personal information management,Information retrieval,Query expansion,Computer science,Sargable,Web query classification,Concept search,Database
Conference
Citations 
PageRank 
References 
2
0.36
53
Authors
3
Name
Order
Citations
PageRank
Wei Wang1152.27
Amélie Marian2128077.92
Thu D. Nguyen31518102.53