Title
A General Embedding Framework for Heterogeneous Information Learning in Large-Scale Networks.
Abstract
Network analysis has been widely applied in many real-world tasks, such as gene analysis and targeted marketing. To extract effective features for these analysis tasks, network embedding automatically learns a low-dimensional vector representation for each node, such that the meaningful topological proximity is well preserved. While the embedding algorithms on pure topological structure have attracted considerable attention, in practice, nodes are often abundantly accompanied with other types of meaningful information, such as node attributes, second-order proximity, and link directionality. A general framework for incorporating the heterogeneous information into network embedding could be potentially helpful in learning better vector representations. However, it remains a challenging task to jointly embed the geometrical structure and a distinct type of information due to the heterogeneity. In addition, the real-world networks often contain a large number of nodes, which put demands on the scalability of the embedding algorithms. To bridge the gap, in this article, we propose a general embedding framework named Heterogeneous Information Learning in Large-scale networks (HILL) to accelerate the joint learning. It enables the simultaneous node proximity assessing process to be done in a distributed manner by decomposing the complex modeling and optimization into many simple and independent sub-problems. We validate the significant correlation between the heterogeneous information and topological structure, and illustrate the generalizability of HILL by applying it to perform attributed network embedding and second-order proximity learning. A variation is proposed for link directionality modeling. Experimental results on real-world networks demonstrate the effectiveness and efficiency of HILL.
Year
DOI
Venue
2018
10.1145/3241063
TKDD
Keywords
Field
DocType
Data mining, distributed processing, heterogeneity, network embedding
Generalizability theory,Embedding,Computer science,Targeted marketing,Artificial intelligence,Network analysis,Network embedding,Machine learning,Scalability
Journal
Volume
Issue
ISSN
12
6
1556-4681
Citations 
PageRank 
References 
1
0.35
45
Authors
4
Name
Order
Citations
PageRank
Xiao Huang123614.27
Jundong Li270950.13
Na Zou3102.67
Xia Hu42411110.07