Title
Extracting research communities from bibliographic data
Abstract
We develop a research community extraction algorithm from large bibliographic data, which was preliminarily reported in Horiike et al. [10] and Nakamura et al. [18]. A research community in bibliographic data is considered to be a set of the linked texts holding a common topic, in other words, it is a dense subgraph embedded in the directed graph. Our method is based on the maximum flow algorithm for finding web communities by Flake et al. [5]. We propose improvements of the algorithm to select community nodes and initial seeds taking account of the restriction that any directed graph is acyclic. We examine the improved algorithm for the list of keywords frequently appearing in the bibliographic data. In addition we propose a simple method to extract characteristic keywords for deciding initial seed nodes. This method is also evaluated by experiments.
Year
DOI
Venue
2012
10.3233/KES-2012-0230
KES Journal
Keywords
Field
DocType
research community,initial seed,maximum flow algorithm,simple method,extracting research community,large bibliographic data,bibliographic data,improved algorithm,research community extraction algorithm,web community,community node
Graph algorithms,Information retrieval,Extraction algorithm,Computer science,Directed graph,Information extraction,Maximum flow problem,Artificial intelligence,Initial Seed,Machine learning
Journal
Volume
Issue
ISSN
16
1
1327-2314
Citations 
PageRank 
References 
0
0.34
18
Authors
4
Name
Order
Citations
PageRank
Yushi Nakamura110.69
Toshihiko Horiike231.06
Tetsuji Kuboyama314029.36
Hiroshi Sakamoto4476.63