Title
Automatic Classification of Websites based on Keyword Extraction of Nouns
Abstract
In this paper, an automatic collection system is proposed that can extract unique keywords appearing in websites belonging to a specific category and that can use these keywords to classify websites into tourism-related categories to establish a dynamic tourism-related Internet directory. First, the keyword extraction algorithm is explained and many tourism-related websites are gathered from the directory-based search engine ?Yahoo! Japan?. Then these sites are classified into categories by applying the proposed algorithm. The experimental results show that the proposed method can classify websites into proper categories with a high degree of precision, and that by setting a threshold evaluation value it can detect unrelated websites not classified in any category.
Year
DOI
Venue
2006
10.1007/3-211-32710-X_38
ENTER
Keywords
Field
DocType
website,keyword extraction,web mining,automatic classification.,search engine,noun
Keyword density,Web mining,Search engine,Information retrieval,Directory,Keyword extraction,Computer science,Noun,Natural language processing,Artificial intelligence,The Internet
Conference
Citations 
PageRank 
References 
1
0.38
2
Authors
3
Name
Order
Citations
PageRank
Takatomo Honda110.38
Masahito Yamamoto210.38
Azuma Ohuchi338668.99