Open-Domain Question Answering Framework Using Wikipedia. - Citegraph

Paper Info

Title
Open-Domain Question Answering Framework Using Wikipedia.

Abstract
This paper explores the feasibility of implementing a model for an open domain, automated question and answering framework that leverages Wikipedia’s knowledgebase. While Wikipedia implicitly comprises answers to common questions, the disambiguation of natural language and the difficulty of developing an information retrieval process that produces answers with specificity present pertinent challenges. However, observational analysis suggests that it is possible to discount the syntactical and lexical structure of a sentence in contexts where questions contain a specific target entity (words that identify a person, location or organisation) and that correspondingly query a property related to it. To investigate this, we implemented an algorithmic process that extracted the target entity from the question using CRF based named entity recognition (NER) and utilised all remaining words as potential properties. Using DBPedia, an ontological database of Wikipedia’s knowledge, we searched for the closest matching property that would produce an answer by applying standardised string matching algorithms including the Levenshtein distance, similar text and Dice’s coefficient. Our experimental results illustrate that using Wikipedia as a knowledgebase produces high precision for questions that contain a singular unambiguous entity as the subject, but lowered accuracy for questions where the entity exists as part of the object.

Year	Venue	Field
2016	Australasian Conference on Artificial Intelligence	Entity linking,String searching algorithm,Ontology,Question answering,Information retrieval,Computer science,Levenshtein distance,Natural language,Artificial intelligence,Natural language processing,Named-entity recognition,Sentence
DocType	Citations	PageRank
Conference	0	0.34
References	Authors
0	4

Authors (4 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Saleem Ameen	1	0	0.34
Hyunsuk Chung	2	14	4.23
Soyeon Caren Han	3	30	11.67
Byeong Ho Kang	4	541	72.76

1