Title
Extracting Room Prices From Web Tables - An Ontology-Aware Approach
Abstract
The growing amount of semi-structured and unstructured data on tourism Web sites with heterogeneous designs requires information extraction (IE) mechanisms, to create, for instance, tourism portals. In order to build semantic eTourism environments, the acquisition of room prices is of particular interest. Room prices and related information often appear in tabular structures, which still challenge Web information extraction techniques. In this paper, we begin by identifying various price table patterns which are characterized by the position of a number of features that determine a room price. We then describe an extended ontology model for tourism prices. Finally, we present TAINEX, a plug-in for functional and structural analysis and data interpretation of price tables, which extends the existing prototype TourIE, a rule-/ontology-based information extraction system for Web sites with heterogeneous designs.
Year
DOI
Venue
2010
10.1007/978-3-211-99407-8_19
INFORMATION AND COMMUNICATION TECHNOLOGIES IN TOURISM 2010
Keywords
Field
DocType
Ontology-based Information Extraction, Table Information Extraction, Price Table Pattern, Tourism Price Ontology, Ontology-aware Price Annotation
Ontology,Information retrieval,Computer science,Data interpretation,Operations research,Tourism,Unstructured data,Information extraction,Web tables,Web information,Marketing
Conference
Citations 
PageRank 
References 
0
0.34
12
Authors
5
Name
Order
Citations
PageRank
Christina Buttinger130.74
Christina Feilmayr2325.66
Michael Guttenbrunner300.34
Stefan Parzer420.69
Birgit Pröll527429.48