Title
SEWISE: An Ontology-based Web Information Search Engine
Abstract
Since the begin of the 90's, the World Wide Web (WWW) rapidly guides the world into a newly amazing electronic village, where everybody can publish everything in electronic form and find almost all required information. The volume of available information is increasing exponentially in different formats, 80% being text. It remains hard to find interesting information directly from Web sources. SEWISE is an ontology-based Web information system to support Web information description and retrieval. According to domain ontology, SEWISE can map text information from various Web sources into one uniform XML structure and make hidden semantic in text accessible to program. The textual information of interest is automatically extracted by Web Wrappers from various Web sources and then text mining techniques such a s categorization and summarization are used to process retrieved text information. Finally, text descriptions are built in XML format that can be directly queried. SEWISE provides support for topic-centric Web information search. The SEWISE prototype is implemented and has been experimented using French financial Web news from several popular sites.
Year
Venue
Keywords
2003
NLDB
world wide web,text mining
Field
DocType
Citations 
Data mining,World Wide Web,Web mining,Web intelligence,Semantic Web Stack,Information retrieval,Web page,Computer science,Web standards,Web information system,Web modeling,Web navigation
Conference
4
PageRank 
References 
Authors
0.49
12
5
Name
Order
Citations
PageRank
G. Gardarin1900710.49
Huaizhong Kou2152.60
Karine Zeitouni318333.69
Xiaofeng Meng41435128.47
Haiyan Wang5106.38