Distilled Wasserstein Learning for Word Embedding and Topic Modeling. - Citegraph

Paper Info

Title
Distilled Wasserstein Learning for Word Embedding and Topic Modeling.

Abstract
We propose a novel Wasserstein method with a distillation mechanism, yielding joint learning of word embeddings and topics. The proposed method is based on the fact that the Euclidean distance between word embeddings may be employed as the underlying distance in the Wasserstein topic model. The word distributions of topics, their optimal transports to the word distributions of documents, and the embeddings of words are learned in a unified framework. When learning the topic model, we leverage a distilled underlying distance matrix to update the topic distributions and smoothly calculate the corresponding optimal transports. Such a strategy provides the updating of word embeddings with robust guidance, improving the algorithmic convergence. As an application, we focus on patient admission records, in which the proposed method embeds the codes of diseases and procedures and learns the topics of admissions, obtaining superior performance on clinically-meaningful disease network construction, mortality prediction as a function of admission codes, and procedure recommendation.

Year	Venue	Keywords
2018	ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018)	unified framework,proposed method,optimal transport,word embedding,topic model,topic modeling,euclidean distance
DocType	Volume	ISSN
Conference	31	1049-5258
Citations	PageRank	References
6	0.38	25
Authors
4

Authors (4 rows)

Cited by (6 rows)

References (25 rows)

Name	Order	Citations	PageRank
Hongteng Xu	1	282	27.10
Wenlin Wang	2	51	7.06
Wei Liu	3	4041	204.19
L. Carin	4	4603	339.36

1