Abstract | ||
---|---|---|
User data stored in personal information systems is growing massively. Simultaneously, this data is increasingly distributed across multiple organizational domains such as email, music databases, and photo albums, some of which are structured automatically by applications. Powerful search tools are needed to help users locate data in these expanding yet fragmented data sets. In this paper, we present a novel fuzzy search approach that considers approximate matches to structure and content query conditions. Our framework uses unified data and query processing models so that structure conditions can be approximately matched by content and vice versa. Our models also unify external structure (e.g., directories) with internal structure (e.g., XML structure), supporting integrated queries matched to a single data domain. We propose indexes and algorithms for efficient query processing. We evaluate our approach using a real data set, showing that it can leverage structure information to significantly improve search accuracy, yet is robust to mistakes in query conditions. |
Year | DOI | Venue |
---|---|---|
2011 | 10.1145/1951365.1951391 | EDBT |
Keywords | Field | DocType |
xml structure,query path matching,unified structure,personal information management system,data structure,unified data,search tool,structure information,additional information,content information,content search,directory structure,structure condition,unify external structure,query processing,single data domain,content component,personal information system,directory information,personal information search,user data,structure and content search,internal structure,personal information,fragmented data set,file boundary,personal information management,computer science,information system | Query optimization,Web search query,Data mining,Query language,Personal information management,Information retrieval,Query expansion,Computer science,Sargable,Web query classification,Concept search,Database | Conference |
Citations | PageRank | References |
2 | 0.36 | 53 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Wei Wang | 1 | 15 | 2.27 |
Amélie Marian | 2 | 1280 | 77.92 |
Thu D. Nguyen | 3 | 1518 | 102.53 |