Abstract | ||
---|---|---|
In this work, we present the Genome Modeling System (GMS), an analysis information management system capable of executing automated genome analysis pipelines at a massive scale. The GMS framework provides detailed tracking of samples and data coupled with reliable and repeatable analysis pipelines. The GMS also serves as a platform for bioinformatics development, allowing a large team to collaborate on data analysis, or an individual researcher to leverage the work of others effectively within its data management system. Rather than separating ad-hoc analysis from rigorous, reproducible pipelines, the GMS promotes systematic integration between the two. As a demonstration of the GMS, we performed an integrated analysis of whole genome, exome and transcriptome sequencing data from a breast cancer cell line (HCC1395) and matched lymphoblastoid line (HCC1395BL). These data are available for users to test the software, complete tutorials and develop novel GMS pipeline configurations. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1371/journal.pcbi.1004274 | PLOS COMPUTATIONAL BIOLOGY |
Field | DocType | Volume |
Genome,Management information systems,Data mining,Data processing,Biology,Exome,Genomics,Genome evolution,Software,Bioinformatics,Genetics,Data management | Journal | 11 |
Issue | ISSN | Citations |
7 | 1553-734X | 5 |
PageRank | References | Authors |
0.79 | 18 | 64 |