Hyperdoc2vec: Distributed Representations Of Hypertext Documents - Citegraph

Paper Info

Title
Hyperdoc2vec: Distributed Representations Of Hypertext Documents

Abstract
Hypertext documents, such as web pages and academic papers, are of great importance in delivering information in our daily life. Although being effective on plain documents, conventional text embedding methods suffer from information loss if directly adapted to hyper-documents. In this paper, we propose a general embedding approach for hyper-documents, namely, hyperdoc2vec, along with four criteria characterizing necessary information that hyper-document embedding models should preserve. Systematic comparisons are conducted between hyperdoc2vec and several competitors on two tasks, i.e., paper classification and citation recommendation, in the academic paper domain. Analyses and experiments both validate the superiority of hyperdoc2vec to other models w.r.t. the four criteria.

Year	DOI	Venue
2018	10.18653/v1/p18-1222	PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1
Field	DocType	Volume
Hypertext,Information loss,Embedding,Information retrieval,Web page,Computer science,Citation,Artificial intelligence,Natural language processing,Competitor analysis	Journal	abs/1805.03793
Citations	PageRank	References
0	0.34	26
Authors
5

Authors (5 rows)

Cited by (0 rows)

References (26 rows)

Name	Order	Citations	PageRank
Jialong Han	1	97	8.65
Yan Song	2	284	53.62
Wayne Xin Zhao	3	1275	66.73
Shuming Shi	4	620	58.27
Haisong Zhang	5	15	8.00

1