Title
Chloroplastdb: The Chloroplast Genome Database
Abstract
The Chloroplast Genome Database (ChloroplastDB) is an interactive, web-based database for fully sequenced plastid genomes, containing genomic, protein, DNA and RNA sequences, gene locations, RNA-editing sites, putative protein families and alignments (http://chloroplast.cbio.psu.edu/). With recent technical advances, the rate of generating new organelle genomes has increased dramatically. However, the established ontology for chloroplast genes and gene features has not been uniformly applied to all chloroplast genomes available in the sequence databases. For example, annotations for some published genome sequences have not evolved with gene naming conventions. ChloroplastDB provides unified annotations, gene name search, BLAST and download functions for chloroplast encoded genes and genomic sequences. A user can retrieve all orthologous sequences with one search regardless of gene names in GenBank. This feature alone greatly facilitates comparative research on sequence evolution including changes in gene content, codon usage, gene structure and post-transcriptional modifications such as RNA editing. Orthologous protein sets are classified by TribeMCL and each set is assigned a standard gene name. Over the next few years, as the number of sequenced chloroplast genomes increases rapidly, the tools available in ChloroplastDB will allow researchers to easily identify and compile target data for comparative analysis of chloroplast genes and genomes.
Year
DOI
Venue
2006
10.1093/nar/gkj055
NUCLEIC ACIDS RESEARCH
Keywords
Field
DocType
comparative analysis,database management systems,genome sequence,rna editing,internet,protein family,codon usage,comparative research,genomics,chloroplasts,gene structure
Genome,Protein family,Gene,Biology,RNA editing,Genomics,Comparative genomics,Genetics,GenBank,Molecular biology,Gene nomenclature
Journal
Volume
Issue
ISSN
34
Database issue
0305-1048
Citations 
PageRank 
References 
8
0.71
5
Authors
8