The CATH hierarchy revisited-structural divergence in domain superfamilies and the continuity of fold space.

PubWeight™: 1.62‹?› | Rank: Top 4%

🔗 View Article (PMC 2741583)

Published in Structure on August 12, 2009

Authors

Alison Cuff1, Oliver C Redfern, Lesley Greene, Ian Sillitoe, Tony Lewis, Mark Dibley, Adam Reid, Frances Pearl, Tim Dallman, Annabel Todd, Richard Garratt, Janet Thornton, Christine Orengo

Author Affiliations

1: Institute of Structural and Molecular Biology, University College London, London, UK. cuff@biochem.ucl.ac.uk

Articles citing this

The RCSB Protein Data Bank: redesigned web site and web services. Nucleic Acids Res (2010) 7.68

Extending CATH: increasing coverage of the protein structure universe and linking structure with function. Nucleic Acids Res (2010) 2.93

Pre-calculated protein structure alignments at the RCSB PDB website. Bioinformatics (2010) 2.28

CATH: comprehensive structural and functional annotations for genome sequences. Nucleic Acids Res (2014) 2.03

Synthetic biology for the directed evolution of protein biocatalysts: navigating sequence space intelligently. Chem Soc Rev (2015) 1.74

Gene3D: a domain-based resource for comparative genomics, functional annotation and protein network analysis. Nucleic Acids Res (2011) 1.65

Further evidence for the likely completeness of the library of solved single domain protein structures. J Phys Chem B (2012) 1.07

A galaxy of folds. Protein Sci (2010) 1.05

Detailed analysis of function divergence in a large and diverse domain superfamily: toward a refined protocol of function classification. Structure (2010) 1.03

The Proteome Folding Project: proteome-scale prediction of structure and function. Genome Res (2011) 0.98

From protein sequences to 3D-structures and beyond: the example of the UniProt knowledgebase. Cell Mol Life Sci (2009) 0.95

Trends in structural coverage of the protein universe and the impact of the Protein Structure Initiative. Proc Natl Acad Sci U S A (2014) 0.92

Why not consider a spherical protein? Implications of backbone hydrogen bonding for protein structure and function. Phys Chem Chem Phys (2011) 0.91

Protein folds and protein folding. Protein Eng Des Sel (2010) 0.85

The CATH database. Hum Genomics (2010) 0.84

Longitudinal genomic surveillance of Plasmodium falciparum malaria parasites reveals complex genomic architecture of emerging artemisinin resistance. Genome Biol (2017) 0.82

Biophysical constraints on the evolution of tissue structure and function. J Physiol (2014) 0.80

An Algebro-topological description of protein domain structure. PLoS One (2011) 0.79

Rebelling for a reason: protein structural "outliers". PLoS One (2013) 0.78

Diversity in protein domain superfamilies. Curr Opin Genet Dev (2015) 0.77

The history of the CATH structural classification of protein domains. Biochimie (2015) 0.76

PDB-Explorer: a web-based interactive map of the protein data bank in shape space. BMC Bioinformatics (2015) 0.76

Development of a motif-based topology-independent structure comparison method to identify evolutionarily related folds. Proteins (2016) 0.75

Extending Protein Domain Boundary Predictors to Detect Discontinuous Domains. PLoS One (2015) 0.75

Protein structural motifs in prediction and design. Curr Opin Struct Biol (2017) 0.75

Articles cited by this

Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet (2000) 336.52

SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol (1995) 74.88

CATH--a hierarchic classification of protein domain structures. Structure (1997) 29.95

The relation between the divergence of sequence and structure in proteins. EMBO J (1986) 16.66

Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation. Bioinformatics (2003) 9.47

The FunCat, a functional annotation scheme for systematic classification of proteins from whole genomes. Nucleic Acids Res (2004) 8.79

COMPASS: a tool for comparison of multiple protein alignments with assessment of statistical significance. J Mol Biol (2003) 8.35

HOMSTRAD: a database of protein structure alignments for homologous families. Protein Sci (1998) 5.62

Protein superfamilies and domain superfolds. Nature (1994) 5.11

The SUPERFAMILY database in 2007: families and functions. Nucleic Acids Res (2006) 4.27

Comprehensive evaluation of protein structure alignment methods: scoring by geometric measures. J Mol Biol (2005) 4.02

The CATH domain structure database: new protocols and classification levels give a more comprehensive resource for exploring evolution. Nucleic Acids Res (2006) 3.84

Fold change in evolution of protein structures. J Struct Biol (2001) 3.46

Quantifying the similarities within fold space. J Mol Biol (2002) 2.65

Structural similarity of DNA-binding domains of bacteriophage repressors and the globin core. Curr Biol (1993) 2.53

Protein families and their evolution-a structural perspective. Annu Rev Biochem (2005) 2.45

Protein structure comparison: implications for the nature of 'fold space', and structure and function prediction. Curr Opin Struct Biol (2006) 2.37

Phylogeny determined by protein domain content. Proc Natl Acad Sci U S A (2005) 2.37

Exploring structural homology of proteins. J Mol Biol (1976) 2.23

Progress of structural genomics initiatives: an analysis of solved target structures. J Mol Biol (2005) 2.22

Structural diversity of domain superfamilies in the CATH database. J Mol Biol (2006) 2.10

On the evolution of protein folds: are similar motifs in different protein folds the result of convergence, insertion, or relics of an ancient peptide world? J Struct Biol (2001) 1.90

Implications of structural genomics target selection strategies: Pfam5000, whole genome, and random approaches. Proteins (2005) 1.71

Alpha plus beta folds revisited: some favoured motifs. Structure (1993) 1.57

Identification and distribution of protein families in 120 completed genomes using Gene3D. Proteins (2005) 1.54

CATHEDRAL: a fast and effective algorithm to predict folds and domain boundaries from multidomain protein structures. PLoS Comput Biol (2007) 1.51

Structural drift: a possible path to protein fold change. Bioinformatics (2004) 1.46

Recognizing the fold of a protein structure. Bioinformatics (2003) 1.45

Progress towards mapping the universe of protein folds. Genome Biol (2004) 1.41

3Dee: a database of protein structural domains. Bioinformatics (2001) 1.33

The structure of protein evolution and the evolution of protein structure. Curr Opin Struct Biol (2008) 1.32

Gene3D: structural assignment for whole genes and genomes using the CATH domain structure database. Genome Res (2002) 1.29

SCOP, Structural Classification of Proteins database: applications to evaluation of the effectiveness of sequence alignment methods and statistics of protein structural data. Acta Crystallogr D Biol Crystallogr (1998) 1.29

Predicting protein function with hierarchical phylogenetic profiles: the Gene3D Phylo-Tuner method applied to eukaryotic genomes. PLoS Comput Biol (2007) 1.29

A discrete view on fold space. Bioinformatics (2008) 1.18

Comprehensive genome analysis of 203 genomes provides structural genomics with new insights into protein family space. Nucleic Acids Res (2006) 1.17

Protein structural domains: analysis of the 3Dee domains database. Proteins (2001) 1.06

Methods of remote homology detection can be combined to increase coverage by 10% in the midnight zone. Bioinformatics (2007) 1.06

Structural similarity of loops in protein families: toward the understanding of protein evolution. BMC Evol Biol (2005) 1.01

What are the baselines for protein fold recognition? Bioinformatics (2001) 0.98

Fragnostic: walking through protein structure space. Nucleic Acids Res (2005) 0.97

Articles by these authors

InterPro: the integrative protein signature database. Nucleic Acids Res (2008) 25.07

InterPro in 2011: new developments in the family and domain prediction database. Nucleic Acids Res (2011) 13.45

New developments in the InterPro database. Nucleic Acids Res (2007) 12.49

Prepublication data sharing. Nature (2009) 12.24

The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis. Nucleic Acids Res (2005) 5.59

The European Bioinformatics Institute's data resources: towards systems biology. Nucleic Acids Res (2005) 4.90

A large-scale evaluation of computational protein function prediction. Nat Methods (2013) 4.61

The CATH domain structure database: new protocols and classification levels give a more comprehensive resource for exploring evolution. Nucleic Acids Res (2006) 3.84

The European dimension for the mouse genome mutagenesis program. Nat Genet (2004) 3.84

Predicting protein function from sequence and structure. Nat Rev Mol Cell Biol (2007) 3.37

Extending CATH: increasing coverage of the protein structure universe and linking structure with function. Nucleic Acids Res (2010) 2.93

The CATH classification revisited--architectures reviewed and new ways to characterize structural divergence in superfamilies. Nucleic Acids Res (2008) 2.92

Gene3D: comprehensive structural and functional annotation of genomes. Nucleic Acids Res (2007) 2.75

Construction, visualisation, and clustering of transcription networks from microarray expression data. PLoS Comput Biol (2007) 2.73

Quantifying the similarities within fold space. J Mol Biol (2002) 2.65

The European Bioinformatics Institute's data resources. Nucleic Acids Res (2009) 2.55

European regulation on orphan medicinal products: 10 years of experience and future perspectives. Nat Rev Drug Discov (2011) 2.46

Gene3D: merging structure and function for a Thousand genomes. Nucleic Acids Res (2009) 2.34

Outcome of a workshop on archiving structural models of biological macromolecules. Structure (2006) 2.32

Gene3D: modelling protein structure, function and evolution. Nucleic Acids Res (2006) 2.30

Protein function annotation by homology-based inference. Genome Biol (2009) 2.20

Structural diversity of domain superfamilies in the CATH database. J Mol Biol (2006) 2.10

Transient protein-protein interactions: structural, functional, and network properties. Structure (2010) 2.07

PSI-2: structural genomics to cover protein domain family space. Structure (2009) 1.92

Structural and chemical profiling of the human cytosolic sulfotransferases. PLoS Biol (2007) 1.90

Gene3D: a domain-based resource for comparative genomics, functional annotation and protein network analysis. Nucleic Acids Res (2011) 1.65

Genome3D: a UK collaborative project to annotate genomic sequences with predicted 3D structures based on SCOP and CATH domains. Nucleic Acids Res (2012) 1.63

The CCPN project: an interim report on a data model for the NMR community. Nat Struct Biol (2002) 1.58

Conformational changes observed in enzyme crystal structures upon substrate binding. J Mol Biol (2004) 1.56

Identification and distribution of protein families in 120 completed genomes using Gene3D. Proteins (2005) 1.54

Structural genomics is the largest contributor of novel structural leverage. J Struct Funct Genomics (2009) 1.52

CATHEDRAL: a fast and effective algorithm to predict folds and domain boundaries from multidomain protein structures. PLoS Comput Biol (2007) 1.51

GeMMA: functional subfamily classification within superfamilies of predicted protein structural domains. Nucleic Acids Res (2009) 1.50

Consensus clustering and functional interpretation of gene-expression data. Genome Biol (2004) 1.48

The Pain in Neuropathy Study (PiNS): a cross-sectional observational study determining the somatosensory phenotype of painful and painless diabetic neuropathy. Pain (2016) 1.47

Recognizing the fold of a protein structure. Bioinformatics (2003) 1.45

Minimum information about a bioactive entity (MIABE). Nat Rev Drug Discov (2011) 1.44

Diapause-associated metabolic traits reiterated in long-lived daf-2 mutants in the nematode Caenorhabditis elegans. Mech Ageing Dev (2006) 1.41

Progress towards mapping the universe of protein folds. Genome Biol (2004) 1.41

Detection of 3D atomic similarities and their use in the discrimination of small molecule protein-binding sites. Bioinformatics (2008) 1.36

The CATH protein family database: a resource for structural and functional annotation of genomes. Proteomics (2002) 1.33

Public health value of next-generation DNA sequencing of enterohemorrhagic Escherichia coli isolates from an outbreak. J Clin Microbiol (2012) 1.31

Towards fully automated structure-based function prediction in structural genomics: a case study. J Mol Biol (2007) 1.26

Retrograde signaling is regulated by the dynamic interaction between Rtg2p and Mks1p. Mol Cell (2003) 1.24

Enteroaggregative E. coli O104 from an outbreak of HUS in Germany 2011, could it happen again? J Infect Dev Ctries (2011) 1.23

Gene3D: Multi-domain annotations for protein sequence and comparative genome analysis. Nucleic Acids Res (2013) 1.23

Assessing strategies for improved superfamily recognition. Protein Sci (2005) 1.21

A fast and automated solution for accurately resolving protein domain architectures. Bioinformatics (2010) 1.20

Trimethylaminuria and a human FMO3 mutation database. Hum Mutat (2003) 1.20

RTG-dependent mitochondria-to-nucleus signaling is regulated by MKS1 and is linked to formation of yeast prion [URE3]. Mol Biol Cell (2002) 1.18

FunTree: a resource for exploring the functional evolution of structurally defined enzyme superfamilies. Nucleic Acids Res (2011) 1.17

The European Bioinformatics Institute's data resources 2014. Nucleic Acids Res (2013) 1.17

ENFIN--A European network for integrative systems biology. C R Biol (2009) 1.17

Retrograde response to mitochondrial dysfunction is separable from TOR1/2 regulation of retrograde gene expression. J Biol Chem (2005) 1.16

CXCL5 mediates UVB irradiation-induced pain. Sci Transl Med (2011) 1.15

Comparison of dorsal root ganglion gene expression in rat models of traumatic and HIV-associated neuropathic pain. Eur J Pain (2008) 1.12

ReadqPCR and NormqPCR: R packages for the reading, quality checking and normalisation of RT-qPCR quantification cycle (Cq) data. BMC Genomics (2012) 1.11

Identification of new herpesvirus gene homologs in the human genome. Genome Res (2002) 1.10

FLORA: a novel method to predict protein function from structure in diverse superfamilies. PLoS Comput Biol (2009) 1.09

Exploring the evolution of novel enzyme functions within structurally defined protein superfamilies. PLoS Comput Biol (2012) 1.09

Activation of the SPS amino acid-sensing pathway in Saccharomyces cerevisiae correlates with the phosphorylation state of a sensor component, Ptr3. Mol Cell Biol (2007) 1.07

Gene3D: structural assignments for the biologist and bioinformaticist alike. Nucleic Acids Res (2003) 1.07

The little elongation complex regulates small nuclear RNA transcription. Mol Cell (2011) 1.06

Detailed analysis of function divergence in a large and diverse domain superfamily: toward a refined protocol of function classification. Structure (2010) 1.03

Exploiting structural classifications for function prediction: towards a domain grammar for protein function. Curr Opin Struct Biol (2009) 1.02

The metastasis-promoting phosphatase PRL-3 shows activity toward phosphoinositides. Biochemistry (2011) 1.00

Annotations for all by all - the BioSapiens network. Genome Biol (2009) 1.00

Severity of children's ADHD symptoms and parenting stress: a multiple mediation model of self-regulation. J Abnorm Child Psychol (2011) 0.97

A comparison of RNA-seq and exon arrays for whole genome transcription profiling of the L5 spinal nerve transection model of neuropathic pain in the rat. Mol Pain (2014) 0.95

Cancer-associated mutations are preferentially distributed in protein kinase functional sites. Proteins (2009) 0.94

The new science of ageing. Philos Trans R Soc Lond B Biol Sci (2011) 0.94

A novel degron-mediated degradation of the RTG pathway regulator, Mks1p, by SCFGrr1. Mol Biol Cell (2005) 0.94