Abstract | ||
---|---|---|
The high heterogeneity of biomedical vocabulary is a major obstacle for information retrieval in large biomedical collections. Therefore, using biomedical controlled vocabularies is crucial for managing these contents. We investigate the impact of query expansion based on controlled vocabularies to improve the effectiveness of two search engines. Our strategy relies on the enrichment of users' queries with additional terms, directly derived from such vocabularies applied to infectious diseases and chemical patents. We observed that query expansion based on pathogen names resulted in improvements of the top-precision of our first search engine, while the normalization of diseases degraded the top-precision. The expansion of chemical entities, which was performed on the second search engine, positively affected the mean average precision. We have shown that query expansion of some types of biomedical entities has a great potential to improve search effectiveness; therefore a fine-tuning of query expansion strategies could help improving the performances of search engines. |
Year | DOI | Venue |
---|---|---|
2013 | 10.3233/978-1-61499-289-9-1068 | Studies in Health Technology and Informatics |
Keywords | Field | DocType |
Controlled vocabularies,Normalization,Information Retrieval | Web search query,Data mining,Search engine,Normalization (statistics),Query expansion,Information retrieval,Computer science,Controlled vocabulary,Natural language processing,Artificial intelligence,Concept search,Vocabulary | Conference |
Volume | ISSN | Citations |
192 | 0926-9630 | 1 |
PageRank | References | Authors |
0.35 | 0 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Emilie Pasche | 1 | 99 | 15.93 |
Julien Gobeill | 2 | 302 | 30.42 |
Dina Vishnyakova | 3 | 113 | 11.16 |
patrick ruch | 4 | 117 | 22.37 |
Christian Lovis | 5 | 349 | 55.53 |