Abstract | ||
---|---|---|
Systematic review (SR) in evidence-based medicine is a literature review which provides a conclusion to a specific clinical question. To assure credible and reproducible conclusions, SRs are conducted by well-defined steps. One of the key steps, the screening step, is to identify relevant documents from a pool of candidate documents. Typically about 2000 candidate documents will be retrieved from databases using keyword queries for a SR. From which, about 20 relevant documents are manually identified by SR experts, based on detailed relevance conditions or eligibility criteria. Recent studies show that document ranking, or screening prioritization, is a promising way to improve the manual screening process. In this paper, we propose a seed-driven document ranking (SDR) model for effective screening, with the assumption that one relevant document is known, i.e., the seed document. Based on a detailed analysis of characteristics of relevant documents, SDR represents documents using bag of clinical terms, rather than the commonly used bag of words. More importantly, we propose a method to estimate the importance of the clinical terms based on their distribution in candidate documents. On benchmark dataset released by CLEF'17 eHealth Task 2, we show that the proposed SDR outperforms state-of-the-art solutions. Interestingly, we also observe that ranking based on word embedding representation of documents well complements SDR. The best ranking is achieved by combining the relevances estimated by SDR and by word embedding. Additionally, we report results of simulating the manual screening process with SDR.
|
Year | DOI | Venue |
---|---|---|
2018 | 10.1145/3209978.3209994 | SIGIR |
Keywords | Field | DocType |
Document Ranking,Systematic Reviews,Seed Document | Bag-of-words model,Information retrieval,Ranking,Systematic review,Computer science,Prioritization,eHealth,Word embedding,Evidence-based medicine | Conference |
ISBN | Citations | PageRank |
978-1-4503-5657-2 | 2 | 0.38 |
References | Authors | |
15 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Grace E. Lee | 1 | 2 | 0.72 |
Aixin Sun | 2 | 3071 | 156.89 |