Graph-based Approach to Automatic Taxonomy Generation (GraBTax). - Citegraph

Paper Info

Title
Graph-based Approach to Automatic Taxonomy Generation (GraBTax).

Abstract
We propose a novel graph-based approach for constructing concept hierarchy from a large text corpus. Our algorithm, GraBTax, incorporates both statistical co-occurrences and lexical similarity in optimizing the structure of the taxonomy. To automatically generate topic-dependent taxonomies from a large text corpus, GraBTax first extracts topical terms and their relationships from the corpus. The algorithm then constructs a weighted graph representing topics and their associations. A graph partitioning algorithm is then used to recursively partition the topic graph into a taxonomy. For evaluation, we apply GraBTax to articles, primarily computer science, in the CiteSeerX digital library and search engine. The quality of the resulting concept hierarchy is assessed by both human judges and comparison with Wikipedia categories.

Year	Venue	Field
2013	CoRR	Lexical similarity,Data mining,Computer science,Natural language processing,Artificial intelligence,Digital library,Graph partition,Recursion,Graph database,Search engine,Information retrieval,Text corpus,Graph (abstract data type)
DocType	Volume	Citations
Journal	abs/1307.1718	0
PageRank	References	Authors
0.34	15	3

Authors (3 rows)

Cited by (0 rows)

References (15 rows)

Name	Order	Citations	PageRank
Pucktada Treeratpituk	1	177	11.12
Madian Khabsa	2	237	18.81
C. Lee Giles	3	11154	1549.48

1