Title
RepeatsDB 2.0: improved annotation, classification, search and visualization of repeat protein structures.
Abstract
RepeatsDB 2.0 (URL:http://repeatsdb.bio.unipd.it/) is an update of the database of annotated tandem repeat protein structures. Repeat proteins are a widespread class of non-globular proteins carrying heterogeneous functions involved in several diseases. Here we provide a new version of RepeatsDB with an improved classification schema including high quality annotations for similar to 5400 protein structures. RepeatsDB 2.0 features information on start and end positions for the repeat regions and units for all entries. The extensive growth of repeat unit characterization was possible by applying the novel ReUPred annotation method over the entire Protein Data Bank, with data quality is guaranteed by an extensive manual validation for >60% of the entries. The updated web interface includes a new search engine for complex queries and a fully re-designed entry page for a better overview of structural data. It is now possible to compare unit positions, together with secondary structure, fold information and Pfam domains. Moreover, a new classification level has been introduced on top of the existing scheme as an independent layer for sequence similarity relationships at 40%, 60% and 90% identity.
Year
DOI
Venue
2017
10.1093/nar/gkw1136
NUCLEIC ACIDS RESEARCH
Field
DocType
Volume
Text mining,Search engine,Annotation,Information retrieval,Biology,Visualization,Bioinformatics,Genetics,Protein structure
Journal
45
Issue
ISSN
Citations 
D1
0305-1048
2
PageRank 
References 
Authors
0.45
13
6
Name
Order
Citations
PageRank
Lisanna Paladin1225.40
Layla Hirsh2142.33
damiano piovesan3588.54
Miguel A. Navarro4153.85
Andrey V Kajava5485.00
Silvio C E Tosatto643537.12