Title
Optimizing high performance computing workflow for protein functional annotation
Abstract
Functional annotation of newly sequenced genomes is one of the major challenges in modern biology. With modern sequencing technologies, the protein sequence universe is rapidly expanding. Newly sequenced bacterial genomes alone contain over 7.5 million proteins. The rate of data generation has far surpassed that of protein annotation. The volume of protein data makes manual curation
Year
DOI
Venue
2014
10.1002/cpe.3264
Concurrency and Computation: Practice & Experience
Keywords
Field
DocType
psu,cog,XSEDE,BLAST,psi-blast,COG,hspp-blast,sequence similarity,data-enabled life sciences,xsede,protein sequence universe,petascale,computational bioinformatics,science gateways,protein annotation,blast,HSPp-BLAST,PSI-BLAST,PS
Data science,Genome,Annotation,Protein sequencing,Computer science,Protein Annotation,Computational biology,Petascale computing,Workflow,Bacterial genome size,Test data generation,Distributed computing
Journal
Volume
Issue
ISSN
26
13
1532-0626
Citations 
PageRank 
References 
2
0.37
11
Authors
9
Name
Order
Citations
PageRank
Larissa Stanberry1295.14
Bhanu Rekepalli2133.23
Yuan Liu320.71
Paul Giblock4101.12
Roger Higdon5436.96
Elizabeth Montague620.37
William Broomall7294.13
Natali Kolker8294.46
Eugene Kolker95410.90