Abstract | ||
---|---|---|
Keyword searching while very successful in narrowing down the contents of the Web to the pertaining subset of information, has two primary drawbacks. First, the accuracy of the search is closely coupled with the choice of keywords. Second, keywords are limited in their expressibility. In particular, they fail to adequately capture the"contextual information" implicit in most searches done by users. In this paper we present an approach to efficiently address these drawbacks of keyword searching over XML documents. In particular, we present SUSAX a system for approximate contextual querying over XML documents wherein queries are represented as simple XPaths. A key contribution of our work is the novel algorithm used to match the XPathlike query with similar paths in the repository. The algorithm is based on sequence alignment algorithms prevalent in life sciences domain for discovering the similarity between genome and protein sequences. In this paper, we show an adaptation of the sequence alignment algorithm for now discovering and cataloging the similarity between |
Year | DOI | Venue |
---|---|---|
2006 | 10.1109/ICDEW.2006.141 | ICDE Workshops |
Keywords | Field | DocType |
sequence alignment algorithm,context-specific searching,sequence alignment,life sciences domain,xpathlike query,protein sequence,novel algorithm,key contribution,contextual information,xml document,xml documents,approximate contextual,search engines,genomics,data engineering,bioinformatics,xml,proteins | Data mining,Efficient XML Interchange,Information retrieval,XML validation,Computer science,Document Structure Description,XML database,XML schema,Database,XML Schema Editor,XML Catalog,XML Signature | Conference |
ISBN | Citations | PageRank |
0-7695-2571-7 | 0 | 0.34 |
References | Authors | |
13 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Krunal Patel | 1 | 0 | 0.34 |
Kajal T. Claypool | 2 | 580 | 64.35 |