Title
Natural Language Processing and Big Data - An Ontology-Based Approach for Cross-Lingual Information Retrieval
Abstract
Extracting relevant information in multilingual context from massive amounts of unstructured, structured and semi-structured data is a challenging task. Various theories have been developed and applied to ease the access to multicultural and multilingual resources. This papers describes a methodology for the development of an ontology-based Cross-Language Information Retrieval (CLIR) application and shows how it is possible to achieve the translation of Natural Language (NL) queries in any language by means of a knowledge-driven approach which allows to semi-automatically map natural language to formal language, simplifying and improving in this way the human-computer interaction and communication. The outlined research activities are based on Lexicon-Grammar (LG), a method devised for natural language formalization, automatic textual analysis and parsing. Thanks to its main characteristics, LG is independent from factors which are critical for other approaches, i.e. interaction type (voice or keyboard-based), length of sentences and propositions, type of vocabulary used and restrictions due to users' idiolects. The feasibility of our knowledge-based methodological framework, which allows mapping both data and metadata, will be tested for CLIR by implementing a domain-specific early prototype system.
Year
DOI
Venue
2013
10.1109/SocialCom.2013.108
Social Computing
Keywords
Field
DocType
domain-specific early prototype system,challenging task,big data,semi-structured data,multilingual context,cross-lingual information retrieval,formal language,ontology-based approach,natural language,natural language formalization,automatic textual analysis,multilingual resource,natural language processing,human-computer interaction,grammars,human computer interaction,formal languages,meta data
Question answering,Information retrieval,Computer science,Object language,Natural language programming,Natural language,Information extraction,Natural language processing,Universal Networking Language,Language identification,Artificial intelligence,Vocabulary
Conference
Citations 
PageRank 
References 
0
0.34
9
Authors
4
Name
Order
Citations
PageRank
Johanna Monti166.25
Mario Monteleone273.63
Maria Pia di Buono327.15
Federica Marano451.86