Title
Extraction of gene-disease relations from Medline using domain dictionaries and machine learning.
Abstract
We describe a system that extracts disease-gene relations from Medline. We constructed a dictionary for disease and gene names from six public databases and extracted relation candidates by dictionary matching. Since dictionary matching produces a large number of false positives, we developed a method of machine learning-based named entity recognition (NER) to filter out false recognitions of disease/gene names. We found that the performance of relation extraction is heavily dependent upon the performance of NER filtering and that the filtering improves the precision of relation extraction by 26.7% at the cost of a small reduction in recall.
Year
Venue
Keywords
2006
Pacific Symposium on Biocomputing
machine learning,false positive,relation extraction
Field
DocType
ISSN
Informatics,Research center,Computer science,Artificial intelligence,MEDLINE,Machine learning
Conference
2335-6936
Citations 
PageRank 
References 
45
2.29
6
Authors
7
Name
Order
Citations
PageRank
Hong-woo Chun116812.86
Yoshimasa Tsuruoka2136488.95
Jin-Dong Kim3170592.21
Rie Shiba4593.15
Naoki Nagata5593.15
Teruyoshi Hishiki69210.99
Jun-ichi Tsujii71973219.85