Abstract | ||
---|---|---|
XML information items collected from heterogeneous sources often carry similar semantics but turn out to be structured in
different ways. Variations in structure make effective search of information across multiple datasources hard to achieve.
Our approach is aimed at a flexible search and processing technique, capable to extract relevant information from a possibly
huge set of XML documents. ApproXML is a software tool supporting approximate pattern-based querying, able to locate and extract
XML information dealing flexibly with differences in structure and tag vocabulary.
Our method relies on representing XML documents as graphs, through a variant of the DOM model. The relevant information is
selected as follows [Dam00a]: first, a XML pattern, i.e. a partially specified subtree, is provided by the user. Then, the XML documents of the target dataset are scanned;
XML fragments are located and sorted according to their similarity to the pattern.
|
Year | DOI | Venue |
---|---|---|
2002 | 10.1007/3-540-45876-X_52 | Extending Database Technology |
Keywords | Field | DocType |
approxml tool demonstration,xml document | Software tool,Data collection,World Wide Web,Information retrieval,XML,Computer science,Tree structure,Vocabulary,Database,Semantics | Conference |
Volume | ISSN | ISBN |
2287 | 0302-9743 | 3-540-43324-4 |
Citations | PageRank | References |
3 | 0.63 | 2 |
Authors | ||
7 |
Name | Order | Citations | PageRank |
---|---|---|---|
Ernesto Damiani | 1 | 3911 | 416.18 |
Nico Lavarini | 2 | 5 | 1.35 |
Stefania Marrara | 3 | 171 | 21.05 |
Barbara Oliboni | 4 | 234 | 25.01 |
Daniele Pasini | 5 | 3 | 0.63 |
Letizia Tanca | 6 | 2330 | 590.73 |
Giuseppe Viviani | 7 | 3 | 0.63 |