Title
Evaluating Resource-Lean Cross-Lingual Embedding Models in Unsupervised Retrieval
Abstract
Cross-lingual embeddings (CLE) facilitate cross-lingual natural language processing and information retrieval. Recently, a wide variety of resource-lean projection-based models for inducing CLEs has been introduced, requiring limited or no bilingual supervision. Despite potential usefulness in downstream IR and NLP tasks, these CLE models have almost exclusively been evaluated on word translation tasks. In this work, we provide a comprehensive comparative evaluation of projection-based CLE models for both sentence-level and document-level cross-lingual Information Retrieval (CLIR). We show that in some settings resource-lean CLE-based CLIR models may outperform resource-intensive models using full-blown machine translation (MT). We hope our work serves as a guideline for choosing the right model for CLIR practitioners.
Year
DOI
Venue
2019
10.1145/3331184.3331324
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval
Keywords
Field
DocType
clir evaluation, cross-lingual IR, cross-lingual embeddings
Cross lingual,Embedding,Information retrieval,Computer science,Machine translation
Conference
ISBN
Citations 
PageRank 
978-1-4503-6172-9
0
0.34
References 
Authors
0
4
Name
Order
Citations
PageRank
Robert Litschko111.39
Goran Glavaš213931.85
Ivan Vulic346252.59
Laura Dietz433928.86