Title
The Interpro Protein Families Database: The Classification Resource After 15 Years
Abstract
The InterPro database (http://www.ebi.ac.uk/interpro/) is a freely available resource that can be used to classify sequences into protein families and to predict the presence of important domains and sites. Central to the InterPro database are predictive models, known as signatures, from a range of different protein family databases that have different biological focuses and use different methodological approaches to classify protein families and domains. InterPro integrates these signatures, capitalizing on the respective strengths of the individual databases, to produce a powerful protein classification resource. Here, we report on the status of InterPro as it enters its 15th year of operation, and give an overview of new developments with the database and its associated Web interfaces and software. In particular, the new domain architecture search tool is described and the process of mapping of Gene Ontology terms to InterPro is outlined. We also discuss the challenges faced by the resource given the explosive growth in sequence data in recent years. InterPro (version 48.0) contains 36 766 member database signatures integrated into 26 238 InterPro entries, an increase of over 3993 entries (5081 signatures), since 2012.
Year
DOI
Venue
2015
10.1093/nar/gku1243
NUCLEIC ACIDS RESEARCH
Keywords
DocType
Volume
bacteria,tertiary,databases,proteins,protein structure,sequence analysis
Journal
43
Issue
ISSN
Citations 
D1
0305-1048
84
PageRank 
References 
Authors
3.64
26
36
Name
Order
Citations
PageRank
Alex Mitchell11120115.01
Hsin-Yu Chang22059.28
Louise Daugherty356649.37
Matthew Fraser437718.54
Sarah Hunter561751.21
Rodrigo Lopez64865744.83
Craig McAnulla760950.49
conor mcmenamin822511.99
Gift Nuka91829.14
Sebastien Pesseat1033617.47
Amaia Sangrador-Vegas1151723.09
Maxim Scheremetjew1234018.25
Claudia Rato13843.64
Siew-Yit Yong1435917.55
Alex Bateman1554611054.58
Marco Punta161709194.79
Teresa K Attwood1797294.32
Christian J A Sigrist181850342.60
N Redaschi192104283.38
Catherine Rivoire2041639.23
Ioannis Xenarios212301293.04
Daniel Kahn221296312.44
Dominique Guyot23843.64
Peer Bork244451694.12
Ivica Letunic251650255.45
Julian Gough2671870.93
Matt E. Oates271226.23
Daniel H. Haft281144230.24
Hongzhan Huang292479346.17
Darren A. Natale302771408.32
Cathy H. Wu314169508.88
Christine A. Orengo321344159.31
Ian Sillitoe3382193.94
Huaiyu Mi34843.64
Paul D Thomas3590482.65
Robert D Finn364179636.56