Title
GenDiS database update with improved approach and features to recognize homologous sequences of protein domain superfamilies.
Abstract
Since proteins evolve by divergent evolution, proteins with distant homology to each other may or may not bear similar functions. Improved computational approaches are required to recognize distant homologues that are functionally similar. One of the methods of assigning function to sequences is to use profiles derived from sequences of known structure. We describe an update of the Genomic Distribution of protein structural domain Superfamilies (GenDiS) database, namely GenDiS+, which provides a projection of SCOP superfamily members on the sequence space (NR database, NCBI). The sequences are validated using structure-based sequence alignment profiles and domain and full-length sequence alignments. GenDiS+ is a 'tour de force' for detecting homologues within around 160 000 taxonomic identifiers, starting from nearly 11 000 domains of known structure. Features, like full-sequence alignment and phylogeny, domain sequence alignment and phylogeny, list of associated structural and sequence domains with strength of interactions, links to databases like Pfam, UniProt and ModBase and list of sequences with a PDB structure, are provided.
Year
DOI
Venue
2019
10.1093/database/baz042
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION
Field
DocType
Volume
Data mining,Protein domain,Computer science,Computational biology,Homologous Sequences
Journal
2019
ISSN
Citations 
PageRank 
1758-0463
0
0.34
References 
Authors
0
4
Name
Order
Citations
PageRank
Meenakshi S. Iyer100.34
Kartik Bhargava200.34
Murugavel Pavalam300.34
Ramanathan Sowdhamini421521.20