Title
Learning multiple graphs for document recommendations
Abstract
The Web offers rich relational data with different semantics. In this paper, we address the problem of document recommendation in a digital library, where the documents in question are networked by citations and are associated with other entities by various relations. Due to the sparsity of a single graph and noise in graph construction, we propose a new method for combining multiple graphs to measure document similarities, where different factorization strategies are used based on the nature of different graphs. In particular, the new method seeks a single low-dimensional embedding of documents that captures their relative similarities in a latent space. Based on the obtained embedding, a new recommendation framework is developed using semi-supervised learning on graphs. In addition, we address the scalability issue and propose an incremental algorithm. The new incremental method significantly improves the efficiency by calculating the embedding for new incoming documents only. The new batch and incremental methods are evaluated on two real world datasets prepared from CiteSeer. Experiments demonstrate significant quality improvement for our batch method and significant efficiency improvement with tolerable quality loss for our incremental method.
Year
DOI
Venue
2008
10.1145/1367497.1367517
WWW
Keywords
Field
DocType
incremental algorithm,new incremental method,batch method,new recommendation framework,new incoming document,multiple graph,different graph,different factorization strategy,new batch,new method,document recommendation,incremental method,spectral clustering,quality improvement,digital library,semi supervised learning,recommender systems,relational data,social network analysis,recommender system,collaborative filtering
Data mining,Semi-supervised learning,Relational database,Computer science,Artificial intelligence,Recommender system,World Wide Web,Collaborative filtering,Embedding,Information retrieval,Social network analysis,Machine learning,Semantics,Scalability
Conference
Citations 
PageRank 
References 
76
3.52
17
Authors
7
Name
Order
Citations
PageRank
Ding Zhou185337.41
Zhu, Shenghuo22996167.68
Yu, Kai34799255.21
Xiaodan Song473354.42
Belle L. Tseng51539143.03
Hongyuan Zha66703422.09
C. Lee Giles7111541549.48