Title
Gathering Services of IHWA from Semi-Structured Web Information Sources
Abstract
Information Harvest WArehouse (IHWA) is a web-based information search system. It is designed using the Component Based Software Engineering (CBSE) paradigm, where applications are to be developed by integrating various software components In this paper, ive describe the development of the meta-information gathering set-vice of IHWA (Meta Gatherer), which collects and extracts information from send-structured or unstructured data sources. Focus is on the development of the information extraction service of Me gatherer from semi-stuctured (DTD-unknown XML data) Internet information sources. The information extraction module implemented provides clean Java program interfaces, so that it can be easily integrated with other applications. Its implementation is an efficient one as well, since it analyzes a source XML file in one path, where most other systems use two paths approach.
Year
Venue
Keywords
2001
IC'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTERNET COMPUTING, VOLS I AND II
CBSE,XML,electronic commerce
Field
DocType
Citations 
World Wide Web,Web intelligence,Information retrieval,Computer science,Web information
Conference
0
PageRank 
References 
Authors
0.34
0
2
Name
Order
Citations
PageRank
Jong-seok Jeong100.34
Oh Dong-ik2434.50