Semantic similarity between Turkish and European languages using word embeddings. - Citegraph

Paper Info

Title
Semantic similarity between Turkish and European languages using word embeddings.

Abstract
Representation of words coming from vocabulary of a language as real vectors in a high dimensional space is called as word embeddings. Word embeddings are proven to be successful in modelling semantic relations between words and numerous natural language processing applications. Although developed mainly for English, word embeddings perform well for many other languages. In this study, semantic similarity between Turkish (two different corpora) and five basic European languages (English, German, French, Spanish, Italian) is calculated using word embeddings over a fixed vocabulary, obtained results are verified using statistical testing. Also, the effect of using different corpora, and additional preprocess steps on the performance of word embeddings on similarity and analogy test sets prepared for Turkish is studied.

Year	Venue	Keywords
2017	Signal Processing and Communications Applications Conference	word embeddings,natural language processing,semantic similarity between languages.
Field	DocType	ISSN
Semantic similarity,Turkish,Word lists by frequency,Computer science,Modeling language,Natural language processing,Artificial intelligence,Analogy,Vocabulary,Semantics,German	Conference	2165-0608
Citations	PageRank	References
0	0.34	8
Authors
4

Authors (4 rows)

Cited by (0 rows)

References (8 rows)

Name	Order	Citations	PageRank
Lutfi Kerem Sjenel	1	0	0.34
Veysel Yücesoy	2	0	1.35
Aykut Koc	3	12	9.01
Tolga Çukur	4	36	8.84

1