Abstract | ||
---|---|---|
In TREC Genomics a question/answering task has been proposed. A set of questions with a specific entity of interest is proposed and a set of passages from a collection of full text documents has to be selected from the document collection provided. We have used a two step approach: the first one is recall-oriented retrieval, and the second is an information extraction system that is intended to provide higher precision. We rely on well known techniques like query expansion and resources like MeSH and UMLS. The information extraction techniques are part of the infras- tructure of the Text Mining Group at European Bioinformatics Institute. Using standard information retrieval techniques has been found more beneficial than using more complex processing. Having analyzed the re- sults we find that the performance of query expansion varies for different topics. There are several reasons. Terminological resources may contain ambiguous synonyms or synonyms whose textual usage patterns differ from the usage of the original query terms. On the whole our performance was similar to the mean results from the three performance measures. |
Year | Venue | Keywords |
---|---|---|
2007 | TREC | query expansion,information retrieval,text mining,question answering,information extraction |
Field | DocType | Citations |
Data mining,Query language,Human–computer information retrieval,Information retrieval,Query expansion,Computer science,Information extraction,Relevance (information retrieval),Adversarial information retrieval,Concept search,TREC Genomics | Conference | 4 |
PageRank | References | Authors |
0.53 | 4 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Antonio Jimeno | 1 | 4 | 0.53 |
P Pezik | 2 | 141 | 9.07 |