Title
Combining biological databases and text mining to support new bioinformatics applications
Abstract
A large amount of biological knowledge today is only available from full-text research papers. Since neither manual database curators nor users can keep up with the rapidly expanding volume of scientific literature, natural language processing approaches are becoming increasingly important for bioinformatic projects. In this paper, we go beyond simply extracting information from full-text articles by describing an architecture that supports targeted access to information from biological databases using the results derived from text mining of research papers, thereby integrating information from both sources within a biological application. The described architecture is currently being used to extract information about protein mutations from full-text research papers. Text mining results drive the retrieval of sequence information from protein databases and the employment of algorithmic sequence analysis tools, which facilitate further data access from protein structure databases. Complex mapping of NLP derived text annotations to protein structures allows the rendering, with 3D structure visualization, of information not available in databases of mutation annotations.
Year
DOI
Venue
2005
10.1007/11428817_28
NLDB
Keywords
Field
DocType
protein structure databases,sequence information,biological databases,new bioinformatics application,full-text research paper,text mining,full-text article,protein structure,protein mutation,biological application,protein databases,biological knowledge,biological database,natural language processing,data access,sequence analysis
Information system,Scientific literature,World Wide Web,Protein structure database,Information retrieval,Computer science,Visualization,Information access,Biological database,Information extraction,Data access,Distributed computing
Conference
Volume
ISSN
ISBN
3513
0302-9743
3-540-26031-5
Citations 
PageRank 
References 
10
1.01
8
Authors
2
Name
Order
Citations
PageRank
René Witte117216.93
christopher j o baker232930.96