Title
Architecture of a mediator for a bioinformatics database federation.
Abstract
Developments in our ability to integrate and analyze data held in existing heterogeneous data resources can lead to an increase in our understanding of biological function at all levels. However, supporting ad hoc queries across multiple data resources and correlating data retrieved from these is still difficult. To address this, we are building a mediator based on the functional data model database, P/FDM, which integrates access to heterogeneous distributed biological databases. Our architecture makes use of the existing search capabilities and indexes of the underlying databases, without infringing on their autonomy. Central to our design philosophy is the use of schemas. We have adopted a federated architecture with a five-level schema, arising from the use of the ANSI-SPARC three-level schema to describe both the existing autonomous data resources and the mediator itself. We describe the use of mapping functions and list comprehensions in query splitting, producing execution plans, code generation, and result fusion. We give an example of cross-database querying involving data held locally in P/FDM systems and external data in SRS.
Year
DOI
Venue
2002
10.1109/TITB.2002.1006298
IEEE transactions on information technology in biomedicine : a publication of the IEEE Engineering in Medicine and Biology Society
Keywords
Field
DocType
query splitting,mapping functions,bioinformatics database federation,mediator architecture,code generation,list comprehensions,indexes,multiple data resources,scientific information systems,result fusion,autonomous data resources,data analysis,execution plans,heterogeneous distributed biological databases,p/fdm functional data model database,ad hoc queries,heterogeneous data resources,biology computing,data models,biological function,cross-database querying,ansi-sparc three-level schema,data correlation,five-level schema,data integration,distributed databases,query processing,program compilers,search
Data science,Query optimization,Data architecture,Data modeling,Data mining,Computer science,Biological database,Code generation,Prolog,Distributed database,Database,The Internet
Journal
Volume
Issue
ISSN
6
2
1089-7771
Citations 
PageRank 
References 
8
0.64
16
Authors
3
Name
Order
Citations
PageRank
Graham J L Kemp1131.14
Nicos Angelopoulos25311.48
Peter M. D. Gray3560243.64