Title
A synergistic strategy for combining thesaurus-based and corpus-based approaches in building ontology for multilingual search engines
Abstract
Cross-language search engine as a tool for co-learning multilingual ontology.The methodology for building cross-language search engine.Combining thesaurus-based and corpus-based approaches.Application of a query translation to retrieve multilingual documents.How to evaluate a multilingual information retrieval system. In this article we illustrate a methodology for building cross-language search engine. A synergistic approach between thesaurus-based approach and corpus-based approach is proposed. First, a bilingual ontology thesaurus is designed with respect to two languages: English and Spanish, where a simple bilingual listing of terms, phrases, concepts, and subconcepts is built. Second, term vector translation is used - a statistical multilingual text retrieval techniques that maps statistical information about term use between languages (Ontology co-learning). These techniques map sets of t f id f term weights from one language to another. We also applied a query translation method to retrieve multilingual documents with an expansion technique for phrasal translation. Finally, we present our findings.
Year
DOI
Venue
2015
10.1016/j.chb.2015.03.021
Computers in Human Behavior
Keywords
DocType
Volume
Multi-language search engines,Cross-language search engines,Ontologies,Social Networks,Ontology co-learning
Journal
51
Issue
ISSN
Citations 
PB
0747-5632
4
PageRank 
References 
Authors
0.39
31
1
Name
Order
Citations
PageRank
Leyla Zhuhadar114617.53