Extracting research communities from bibliographic data - Citegraph

Paper Info

Title
Extracting research communities from bibliographic data

Abstract
We develop a research community extraction algorithm from large bibliographic data, which was preliminarily reported in Horiike et al. [10] and Nakamura et al. [18]. A research community in bibliographic data is considered to be a set of the linked texts holding a common topic, in other words, it is a dense subgraph embedded in the directed graph. Our method is based on the maximum flow algorithm for finding web communities by Flake et al. [5]. We propose improvements of the algorithm to select community nodes and initial seeds taking account of the restriction that any directed graph is acyclic. We examine the improved algorithm for the list of keywords frequently appearing in the bibliographic data. In addition we propose a simple method to extract characteristic keywords for deciding initial seed nodes. This method is also evaluated by experiments.

Year	DOI	Venue
2012	10.3233/KES-2012-0230	KES Journal
Keywords	Field	DocType
research community,initial seed,maximum flow algorithm,simple method,extracting research community,large bibliographic data,bibliographic data,improved algorithm,research community extraction algorithm,web community,community node	Graph algorithms,Information retrieval,Extraction algorithm,Computer science,Directed graph,Information extraction,Maximum flow problem,Artificial intelligence,Initial Seed,Machine learning	Journal
Volume	Issue	ISSN
16	1	1327-2314
Citations	PageRank	References
0	0.34	18
Authors
4

Authors (4 rows)

Cited by (0 rows)

References (18 rows)

Name	Order	Citations	PageRank
Yushi Nakamura	1	1	0.69
Toshihiko Horiike	2	3	1.06
Tetsuji Kuboyama	3	140	29.36
Hiroshi Sakamoto	4	47	6.63

1