Automatic Classification of Websites based on Keyword Extraction of Nouns - Citegraph

Paper Info

Title
Automatic Classification of Websites based on Keyword Extraction of Nouns

Abstract
In this paper, an automatic collection system is proposed that can extract unique keywords appearing in websites belonging to a specific category and that can use these keywords to classify websites into tourism-related categories to establish a dynamic tourism-related Internet directory. First, the keyword extraction algorithm is explained and many tourism-related websites are gathered from the directory-based search engine ?Yahoo! Japan?. Then these sites are classified into categories by applying the proposed algorithm. The experimental results show that the proposed method can classify websites into proper categories with a high degree of precision, and that by setting a threshold evaluation value it can detect unrelated websites not classified in any category.

Year	DOI	Venue
2006	10.1007/3-211-32710-X_38	ENTER
Keywords	Field	DocType
website,keyword extraction,web mining,automatic classification.,search engine,noun	Keyword density,Web mining,Search engine,Information retrieval,Directory,Keyword extraction,Computer science,Noun,Natural language processing,Artificial intelligence,The Internet	Conference
Citations	PageRank	References
1	0.38	2
Authors
3

Authors (3 rows)

Cited by (1 rows)

References (2 rows)

Name	Order	Citations	PageRank
Takatomo Honda	1	1	0.38
Masahito Yamamoto	2	1	0.38
Azuma Ohuchi	3	386	68.99

1