Volume changes in protein evolution.

PubWeight™: 12.07‹?› | Rank: Top 0.1% | All-Time Top 10000

🔗 View Article (PMID 8120887)

Published in J Mol Biol on March 04, 1994

Authors

M Gerstein1, E L Sonnhammer, C Chothia

Author Affiliations

1: MRC Laboratory of Molecular Biology, Cambridge, U.K.

Articles citing this

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res (1997) 665.31

MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics (2004) 50.89

Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements. Nucleic Acids Res (2001) 22.33

Rfam: updates to the RNA families database. Nucleic Acids Res (2008) 11.61

The distributions, mechanisms, and structures of metabolite-binding riboswitches. Genome Biol (2007) 3.80

Identification of 22 candidate structured RNAs in bacteria using the CMfinder comparative genomics pipeline. Nucleic Acids Res (2007) 3.59

Comprehensive assessment of automatic structural alignment against a manual standard, the scop classification of proteins. Protein Sci (1998) 2.60

The structural alignment between two proteins: is there a unique answer? Protein Sci (1996) 2.50

6S RNA is a widespread regulator of eubacterial RNA polymerase that resembles an open promoter. RNA (2005) 2.28

Integrated analysis of experimental data sets reveals many novel promoters in 1% of the human genome. Genome Res (2007) 1.82

PSI-BLAST pseudocounts and the minimum description length principle. Nucleic Acids Res (2008) 1.68

A phylogenomic profile of globins. BMC Evol Biol (2006) 1.64

R2R--software to speed the depiction of aesthetic consensus RNA secondary structures. BMC Bioinformatics (2011) 1.60

The construction and use of log-odds substitution scores for multiple sequence alignment. PLoS Comput Biol (2010) 1.54

Predicting N-terminal myristoylation sites in plant proteins. BMC Genomics (2004) 1.53

Riboswitches in eubacteria sense the second messenger c-di-AMP. Nat Chem Biol (2013) 1.47

Quantitative assessment of protein function prediction from metagenomics shotgun sequences. Proc Natl Acad Sci U S A (2007) 1.47

A structural census of the current population of protein sequences. Proc Natl Acad Sci U S A (1997) 1.44

Structural alignment of proteins by a novel TOPOFIT method, as a superimposition of common volumes at a topomax point. Protein Sci (2004) 1.39

Improved profile HMM performance by assessment of critical algorithmic features in SAM and HMMER. BMC Bioinformatics (2005) 1.33

CRASP: a program for analysis of coordinated substitutions in multiple alignments of protein sequences. Nucleic Acids Res (2004) 1.32

Identification of candidate structured RNAs in the marine organism 'Candidatus Pelagibacter ubique'. BMC Genomics (2009) 1.21

Bioinformatics assessment of beta-myosin mutations reveals myosin's high sensitivity to mutations. Trends Cardiovasc Med (2008) 1.20

Alignment of 700 globin sequences: extent of amino acid substitution and its correlation with variation in volume. Protein Sci (1995) 1.14

Sequence variation in G-protein-coupled receptors: analysis of single nucleotide polymorphisms. Nucleic Acids Res (2005) 1.12

The weighted-volume derivative of a space-filling diagram. Proc Natl Acad Sci U S A (2003) 1.11

Cavities and atomic packing in protein structures and interfaces. PLoS Comput Biol (2008) 1.08

Optimization algorithms for functional deimmunization of therapeutic proteins. BMC Bioinformatics (2010) 1.08

Constructing a meaningful evolutionary average at the phylogenetic center of mass. BMC Bioinformatics (2007) 1.02

LPFC: an Internet library of protein family core structures. Protein Sci (1997) 1.02

Augmented training of hidden Markov models to recognize remote homologs via simulated evolution. Bioinformatics (2009) 0.97

Design and analysis of immune-evading enzymes for ADEPT therapy. Protein Eng Des Sel (2012) 0.95

Modeling an evolutionary conserved circadian cis-element. PLoS Comput Biol (2008) 0.93

Proteins: form and function. Bioeng Bugs (2012) 0.92

Thermodynamics of protein destabilization in live cells. Proc Natl Acad Sci U S A (2015) 0.91

Folding simulations of alanine-based peptides with lysine residues. Biophys J (1995) 0.88

Pathway analysis of genome-wide data improves warfarin dose prediction. BMC Genomics (2013) 0.87

Recognition of beta-structural motifs using hidden Markov models trained with simulated evolution. Bioinformatics (2010) 0.87

An assessment of substitution scores for protein profile-profile comparison. Bioinformatics (2011) 0.85

Computational identification of novel amino-acid interactions in HIV Gag via correlated evolution. PLoS One (2012) 0.83

Widespread occurrence of organelle genome-encoded 5S rRNAs including permuted molecules. Nucleic Acids Res (2014) 0.83

CORECLUST: identification of the conserved CRM grammar together with prediction of gene regulation. Nucleic Acids Res (2012) 0.82

Identification of specificity determining residues in peptide recognition domains using an information theoretic approach applied to large-scale binding maps. BMC Biol (2011) 0.82

A statistical test for conserved RNA structure shows lack of evidence for structure in lncRNAs. Nat Methods (2016) 0.81

Predicting drug-target interactions using drug-drug interactions. PLoS One (2013) 0.77

Dissecting the roles of local packing density and longer-range effects in protein sequence evolution. Proteins (2016) 0.77

Ambivalent covariance models. BMC Bioinformatics (2015) 0.77

IRBIS: a systematic search for conserved complementarity. RNA (2014) 0.76

A sequence sub-sampling algorithm increases the power to detect distant homologues. Nucleic Acids Res (2005) 0.75

Identification of Position-Specific Correlations between DNA-Binding Domains and Their Binding Sites. Application to the MerR Family of Transcription Factors. PLoS One (2016) 0.75

The conserved characteristics of DNA-binding domains belonging to the homeodomain class that are associated with coadaptive substitutions of amino acid residues. Dokl Biochem Biophys (2001) 0.75

Articles by these authors

SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol (1995) 74.88

Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol (2001) 66.87

The Pfam protein families database. Nucleic Acids Res (2000) 42.28

Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs. Nature (2002) 28.79

Determinants of a protein fold. Unique features of the globin amino acid sequences. J Mol Biol (1987) 18.58

The relation between the divergence of sequence and structure in proteins. EMBO J (1986) 16.66

Automatic clustering of orthologs and in-paralogs from pairwise species comparisons. J Mol Biol (2001) 16.47

Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure. J Mol Biol (2001) 15.97

SCOP: a structural classification of proteins database. Nucleic Acids Res (2000) 14.14

The atomic structure of protein-protein recognition sites. J Mol Biol (1999) 12.63

Pfam 3.1: 1313 multiple alignments and profile HMMs match the majority of proteins. Nucleic Acids Res (1999) 11.64

Sequence comparisons using multiple sequences detect three times as many remote homologues as pairwise methods. J Mol Biol (1998) 9.09

Proteins. One thousand families for the molecular biologist. Nature (1992) 7.83

Assessing sequence comparison methods with reliable structurally identified distant evolutionary relationships. Proc Natl Acad Sci U S A (1998) 7.18

Principles of protein-protein recognition. Nature (1975) 6.50

Structural patterns in globular proteins. Nature (1976) 5.95

The nature of the accessible and buried surfaces in proteins. J Mol Biol (1976) 5.92

How different amino acid sequences determine similar protein structures: the structure and evolutionary dynamics of the globins. J Mol Biol (1980) 5.85

Hydrophobic bonding and accessible surface area in proteins. Nature (1974) 5.69

Structural invariants in protein folding. Nature (1975) 5.60

Interior and surface of monomeric proteins. J Mol Biol (1987) 5.42

Structural mechanisms for domain movements in proteins. Biochemistry (1994) 5.36

Surface, subunit interfaces and interior of oligomeric proteins. J Mol Biol (1988) 4.80

The structure of protein-protein recognition sites. J Biol Chem (1990) 4.55

Helix to helix packing in proteins. J Mol Biol (1981) 4.24

Understanding protein structure: using scop for fold interpretation. Methods Enzymol (1996) 4.22

Principles that determine the structure of proteins. Annu Rev Biochem (1984) 4.15

Haemoglobin: the structural changes related to ligand binding and its allosteric mechanism. J Mol Biol (1979) 3.79

NIFAS: visual analysis of domain evolution in proteins. Bioinformatics (2001) 3.60

Many of the immunoglobulin superfamily domains in cell adhesion molecules and surface receptors belong to a new structural set which is close to that containing variable domains. J Mol Biol (1994) 3.57

Volume changes on protein folding. Structure (1994) 3.54

The packing density in proteins: standard radii and volumes. J Mol Biol (1999) 3.48

Standard conformations for the canonical structures of immunoglobulins. J Mol Biol (1997) 3.42

Conformation of twisted beta-pleated sheets in proteins. J Mol Biol (1973) 3.15

Intermediate sequences increase the detection of homology between sequences. J Mol Biol (1997) 2.98

The accessible surface area and stability of oligomeric proteins. Nature (1987) 2.77

Cadherin superfamily proteins in Caenorhabditis elegans and Drosophila melanogaster. J Mol Biol (2001) 2.64

Structure of proteins: packing of alpha-helices and pleated sheets. Proc Natl Acad Sci U S A (1977) 2.60

RSDB: representative protein sequence databases have high information content. Bioinformatics (2000) 2.55

Evolution of proteins formed by beta-sheets. II. The core of the immunoglobulin domains. J Mol Biol (1982) 2.47

beta-Trefoil fold. Patterns of structure and sequence in the Kunitz inhibitors interleukins-1 beta and 1 alpha and fibroblast growth factors. J Mol Biol (1992) 2.45

SCOP: a structural classification of proteins database. Nucleic Acids Res (1997) 2.44

SCOP: a Structural Classification of Proteins database. Nucleic Acids Res (1999) 2.20

The evolution and structural anatomy of the small molecule metabolic pathways in Escherichia coli. J Mol Biol (2001) 2.15

Advances in structural genomics. Curr Opin Struct Biol (1999) 2.09

Structural assignments to the Mycoplasma genitalium proteins show extensive gene duplications and domain rearrangements. Proc Natl Acad Sci U S A (1998) 2.08

Fast assignment of protein structures to sequences using the intermediate sequence library PDB-ISL. Bioinformatics (2000) 2.04

Sequence of the human immunoglobulin diversity (D) segment locus: a systematic analysis provides no evidence for the use of DIR segments, inverted D segments, "minor" D segments or D-D recombination. J Mol Biol (1997) 2.03

Domain association in immunoglobulin molecules. The packing of variable domains. J Mol Biol (1985) 2.03

Population statistics of protein structures: lessons from structural classifications. Curr Opin Struct Biol (1997) 1.98

Conformations of the third hypervariable region in the VH domain of immunoglobulins. J Mol Biol (1998) 1.96

FAT: a novel domain in PIK-related kinases. Trends Biochem Sci (2000) 1.96

Packing at the protein-water interface. Proc Natl Acad Sci U S A (1996) 1.91

Structural principles of alpha/beta barrel proteins: the packing of the interior of the sheet. Proteins (1989) 1.78

Evolution of proteins formed by beta-sheets. I. Plastocyanin and azurin. J Mol Biol (1982) 1.76

The structure of a PKD domain from polycystin-1: implications for polycystic kidney disease. EMBO J (1999) 1.75

Determination of protein function, evolution and interactions by structural genomics. Curr Opin Struct Biol (2001) 1.75

The predicted structure of immunoglobulin D1.3 and its comparison with the crystal structure. Science (1986) 1.68

Principles determining the structure of beta-sheet barrels in proteins. I. A theoretical analysis. J Mol Biol (1994) 1.66

Domain closure in adenylate kinase. Joints on either side of two helices close like neighboring fingers. J Mol Biol (1993) 1.61

Role of hydrophobicity in the binding of coenzymes. Appendix. Translational and rotational contribution to the free energy of dissociation. Biochemistry (1978) 1.61

Domain closure in lactoferrin. Two hinges produce a see-saw motion between alternative close-packed interfaces. J Mol Biol (1993) 1.55

Orthogonal packing of beta-pleated sheets in proteins. Biochemistry (1982) 1.55

Framework residue 71 is a major determinant of the position and conformation of the second hypervariable region in the VH domains of immunoglobulins. J Mol Biol (1990) 1.53

Molecular structure of a new family of ribonucleases. Nature (1982) 1.52

Outline structure of the human L1 cell adhesion molecule and the sites where mutations cause neurological disorders. EMBO J (1996) 1.51

Analysis of protein loop closure. Two types of hinges produce one motion in lactate dehydrogenase. J Mol Biol (1991) 1.50

Helix movements and the reconstruction of the haem pocket during the evolution of the cytochrome c family. J Mol Biol (1985) 1.49

Transmission of conformational change in insulin. Nature (1983) 1.48

Domain closure in mitochondrial aspartate aminotransferase. J Mol Biol (1992) 1.46

Comparative analysis of the polycystic kidney disease 1 (PKD1) gene reveals an integral membrane glycoprotein with multiple evolutionary conserved domains. Hum Mol Genet (1997) 1.45

Widespread eukaryotic sequences, highly similar to bacterial DNA polymerase I, looking for functions. Curr Biol (1997) 1.45

Elbow motion in the immunoglobulins involves a molecular ball-and-socket joint. Nature (1988) 1.39

Alignment of the amino acid sequences of distantly related proteins using variable gap penalties. Protein Eng (1989) 1.34

Structure and stability of an immunoglobulin superfamily domain from twitchin, a muscle protein of the nematode Caenorhabditis elegans. J Mol Biol (1996) 1.33

Principles determining the structure of beta-sheet barrels in proteins. II. The observed structures. J Mol Biol (1994) 1.33

SCOP, Structural Classification of Proteins database: applications to evaluation of the effectiveness of sequence alignment methods and statistics of protein structural data. Acta Crystallogr D Biol Crystallogr (1998) 1.29

The structural repertoire of the human V kappa domain. EMBO J (1995) 1.26

A comparison of sequence and structure protein domain families as a basis for structural genomics. Bioinformatics (1999) 1.25

Gene duplications in H. influenzae. Nature (1995) 1.24

Role of subunit interfaces in the allosteric mechanism of hemoglobin. Proc Natl Acad Sci U S A (1976) 1.23

Mechanisms of domain closure in proteins. J Mol Biol (1984) 1.22

The imprint of somatic hypermutation on the repertoire of human germline V genes. J Mol Biol (1996) 1.21

Serpin tertiary structure transformation. J Mol Biol (1991) 1.21

Conservation of folding and stability within a protein family: the tyrosine corner as an evolutionary cul-de-sac. J Mol Biol (2000) 1.20

Protein evolution. How far can sequences diverge? Nature (1997) 1.20

Domains in proteins: definitions, location, and structural principles. Methods Enzymol (1985) 1.20

Packing of alpha-helices onto beta-pleated sheets and the anatomy of alpha/beta proteins. J Mol Biol (1980) 1.17

Stability and specificity of protein-protein interactions: the case of the trypsin-trypsin inhibitor complexes. J Mol Biol (1976) 1.15

Structural determinants of the conformations of medium-sized loops in proteins. Proteins (1989) 1.15

Antibody structure, prediction and redesign. Biophys Chem (1997) 1.10

Immunoglobulin superfamily proteins in Caenorhabditis elegans. J Mol Biol (2000) 1.09

Small-molecule metabolism: an enzyme mosaic. Trends Biotechnol (2001) 1.08

Coiling of beta-pleated sheets. J Mol Biol (1983) 1.03

Canonical structures for the hypervariable regions of T cell alphabeta receptors. J Mol Biol (2000) 1.00

MEDUSA: large scale automatic selection and visual assessment of PCR primer pairs. Bioinformatics (2001) 0.95

Members of the immunoglobulin superfamily in bacteria. Protein Sci (1996) 0.94