Title
A metadata framework for interoperating heterogeneous genome data using XML.
Abstract
The rapid advances in the Human Genome Project and genomic technologies have produced massive amounts of data populated in a large number Of network-accessible databases. These technological advances and the associated data can have a great impact on biomedicine and healthcare. To answer many of the biologically or medically important questions, researchers often need to integrate data from a number of independent but related genome databases. One common practice is to download data sets (text files) from various genome Web sites and process them by some local programs. One main problem with this approach is that these programs are written on a case-by-case basis because the data sets involved are heterogeneous in structure. To address this problem, we define metadata that maps these heterogeneously structured files into a common eXtensible Markup Language (XML) structure to facilitate data interoperation. We illustrate this approach by interoperating two sets of essential yeast genes that are stored in two yeast genome databases (MIPS and YPD).
Year
Venue
Keywords
2001
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION
internet,programming languages
Field
DocType
Issue
Genome,Metadata,Data set,World Wide Web,XML,Computer science,Interoperation,Biomedicine,Human genome,Database,The Internet
Conference
SUPnan
ISSN
Citations 
PageRank 
1067-5027
1
0.40
References 
Authors
0
8
Name
Order
Citations
PageRank
Kei-hoi Cheung166460.65
Aniruddha M. Deshpande2255.91
Nick Tosches3133.68
S. Nath410.73
A. Agrawal562.70
P L Miller644593.86
Anuj Kumar72311.02
Michael Snyder813826.15