Unsupervised Word Embedding Learning By Incorporating Local And Global Contexts - Citegraph

Paper Info

Title
Unsupervised Word Embedding Learning By Incorporating Local And Global Contexts

Abstract
Word embedding has benefited a broad spectrum of text analysis tasks by learning distributed word representations to encode word semantics. Word representations are typically learned by modeling local contexts of words, assuming that words sharing similar surrounding words are semantically close. We argue that local contexts can only partially define word semantics in the unsupervised word embedding learning. Global contexts, referring to the broader semantic units, such as the document or paragraph where the word appears, can capture different aspects of word semantics and complement local contexts. We propose two simple yet effective unsupervised word embedding models that jointly model both local and global contexts to learn word representations. We provide theoretical interpretations of the proposed models to demonstrate how local and global contexts are jointly modeled, assuming a generative relationship between words and contexts. We conduct a thorough evaluation on a wide range of benchmark datasets. Our quantitative analysis and case study show that despite their simplicity, our two proposed models achieve superior performance on word similarity and text classification tasks.

Year	DOI	Venue
2020	10.3389/fdata.2020.00009	FRONTIERS IN BIG DATA
Keywords	DocType	Volume
word embedding, unsupervised learning, word semantics, local contexts, global contexts	Journal	3
ISSN	Citations	PageRank
2624-909X	0	0.34
References	Authors
0	6

Authors (6 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Yu Meng	1	49	11.09
Jiaxin Huang	2	25	6.63
Guangyuan Wang	3	16	5.01
Zihan Wang	4	0	0.34
Chao Zhang	5	939	103.66
Jiawei Han	6	43085	3824.48

1