Selection of representative protein data sets.

PubWeight™: 15.62‹?› | Rank: Top 0.1% | All-Time Top 10000

🔗 View Article (PMC 2142204)

Published in Protein Sci on March 01, 1992

Authors

U Hobohm1, M Scharf, R Schneider, C Sander

Author Affiliations

1: European Molecular Biology Laboratory, Heidelberg, Germany.

Articles citing this

(truncated to the top 100)

RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res (2007) 85.81

The ASTRAL compendium for protein structure and sequence analysis. Nucleic Acids Res (2000) 11.38

STRING v10: protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res (2014) 10.79

Automated main-chain model building by template matching and iterative fragment extension. Acta Crystallogr D Biol Crystallogr (2002) 10.34

ChloroP, a neural network-based method for predicting chloroplast transit peptides and their cleavage sites. Protein Sci (1999) 9.82

The HSSP database of protein structure-sequence alignments and family profiles. Nucleic Acids Res (1998) 9.59

The EMBL data library. Nucleic Acids Res (1993) 8.24

Enlarged representative set of protein structures. Protein Sci (1994) 7.98

Genome-wide analysis of integral membrane proteins from eubacterial, archaean, and eukaryotic organisms. Protein Sci (1998) 7.88

Distance-scaled, finite ideal-gas reference state improves structure-derived potentials of mean force for structure selection and stability prediction. Protein Sci (2002) 7.40

The European Bioinformatics Institute (EBI) databases. Nucleic Acids Res (1996) 7.19

Cation-pi interactions in structural biology. Proc Natl Acad Sci U S A (1999) 6.88

Prediction of lipoprotein signal peptides in Gram-negative bacteria. Protein Sci (2003) 6.85

EasyGene--a prokaryotic gene finder that ranks ORFs by statistical significance. BMC Bioinformatics (2003) 6.63

MAMMOTH (matching molecular models obtained from theory): an automated method for model comparison. Protein Sci (2002) 5.47

A database of protein structure families with common folding motifs. Protein Sci (1992) 4.77

NetMHCpan, a method for quantitative predictions of peptide binding to any HLA-A and -B locus protein of known sequence. PLoS One (2007) 3.99

Improved prediction of protein secondary structure by use of sequence profiles and neural networks. Proc Natl Acad Sci U S A (1993) 3.92

A series of PDB related databases for everyday needs. Nucleic Acids Res (2010) 3.60

A generic method for assignment of reliability scores applied to solvent accessibility predictions. BMC Struct Biol (2009) 3.56

PISCES: recent improvements to a PDB sequence culling server. Nucleic Acids Res (2005) 3.55

Automated side-chain model building and sequence assignment by template matching. Acta Crystallogr D Biol Crystallogr (2002) 3.48

Best alpha-helical transmembrane protein topology predictions are achieved using hidden Markov models and evolutionary information. Protein Sci (2004) 3.40

Dali/FSSP classification of three-dimensional protein folds. Nucleic Acids Res (1997) 3.34

NetOglyc: prediction of mucin type O-glycosylation sites based on sequence context and surface accessibility. Glycoconj J (1998) 3.19

Intrinsic disorder in transcription factors. Biochemistry (2006) 3.06

Ab initio protein structure assembly using continuous structure fragments and optimized knowledge-based force field. Proteins (2012) 2.94

Experimentally observed conformation-dependent geometry and hidden strain in proteins. Protein Sci (1996) 2.90

Prediction of MHC class II binding affinity using SMM-align, a novel stabilization matrix alignment method. BMC Bioinformatics (2007) 2.83

PRED-TMBB: a web server for predicting the topology of beta-barrel outer membrane proteins. Nucleic Acids Res (2004) 2.76

Comprehensive assessment of automatic structural alignment against a manual standard, the scop classification of proteins. Protein Sci (1998) 2.60

Peptide binding predictions for HLA DR, DP and DQ molecules. BMC Bioinformatics (2010) 2.53

NN-align. An artificial neural network-based alignment algorithm for MHC class II peptide binding prediction. BMC Bioinformatics (2009) 2.50

Quantitative predictions of peptide binding to any HLA-DR molecule of known sequence: NetMHCIIpan. PLoS Comput Biol (2008) 2.48

The HSSP database of protein structure-sequence alignments. Nucleic Acids Res (1997) 2.44

Sequence conserved for subcellular localization. Protein Sci (2002) 2.36

STITCH 4: integration of protein-chemical interactions with user data. Nucleic Acids Res (2013) 2.32

Transmembrane helix predictions revisited. Protein Sci (2002) 2.28

Insufficiently dehydrated hydrogen bonds as determinants of protein interactions. Proc Natl Acad Sci U S A (2002) 2.20

Evaluation and comparison of mammalian subcellular localization prediction methods. BMC Bioinformatics (2006) 2.11

Probing metagenomics by rapid cluster analysis of very large datasets. PLoS One (2008) 2.11

The FSSP database: fold classification based on structure-structure alignment of proteins. Nucleic Acids Res (1996) 2.10

CPHmodels-3.0--remote homology modeling using structure-guided sequence profiles. Nucleic Acids Res (2010) 1.94

Fast pairwise structural RNA alignments by pruning of the dynamical programming matrix. PLoS Comput Biol (2007) 1.90

A Hidden Markov Model method, capable of predicting and discriminating beta-barrel outer membrane proteins. BMC Bioinformatics (2004) 1.90

Coupled prediction of protein secondary and tertiary structure. Proc Natl Acad Sci U S A (2003) 1.90

Structural analysis based on state-space modeling. Protein Sci (1993) 1.89

Digging for dead genes: an analysis of the characteristics of the pseudogene population in the Caenorhabditis elegans genome. Nucleic Acids Res (2001) 1.85

Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion. Nucleic Acids Res (2012) 1.81

PDBselect 1992-2009 and PDBfilter-select. Nucleic Acids Res (2009) 1.76

An accurate, residue-level, pair potential of mean force for folding and binding based on the distance-scaled, ideal-gas reference state. Protein Sci (2004) 1.74

The EMBL Nucleotide Sequence Database. Nucleic Acids Res (1997) 1.71

The HSSP database of protein structure-sequence alignments. Nucleic Acids Res (1996) 1.70

Sequence-similar, structure-dissimilar protein pairs in the PDB. Proteins (2008) 1.68

Cleaning the GenBank Arabidopsis thaliana data set. Nucleic Acids Res (1996) 1.62

Contact order and ab initio protein structure prediction. Protein Sci (2002) 1.59

DIAL: a web server for the pairwise alignment of two RNA three-dimensional structures using nucleotide, dihedral angle and base-pairing similarities. Nucleic Acids Res (2007) 1.57

Accurate and efficient loop selections by the DFIRE-based all-atom statistical potential. Protein Sci (2004) 1.53

CODA: a combined algorithm for predicting the structurally variable regions of protein models. Protein Sci (2001) 1.53

Are proteins ideal mixtures of amino acids? Analysis of energy parameter sets. Protein Sci (1995) 1.53

NetMHCIIpan-3.0, a common pan-specific MHC class II prediction method including all three human MHC class II isotypes, HLA-DR, HLA-DP and HLA-DQ. Immunogenetics (2013) 1.49

Three-dimensional profiles from residue-pair preferences: identification of sequences with beta/alpha-barrel fold. Proc Natl Acad Sci U S A (1993) 1.45

Ab initio folding of terminal segments with secondary structures reveals the fine difference between two closely related all-atom statistical energy functions. Protein Sci (2008) 1.45

A structural census of the current population of protein sequences. Proc Natl Acad Sci U S A (1997) 1.44

MUFOLD: A new solution for protein 3D structure prediction. Proteins (2010) 1.42

Prediction of enzyme function based on 3D templates of evolutionarily important amino acids. BMC Bioinformatics (2008) 1.41

Evaluation of methods for predicting the topology of beta-barrel outer membrane proteins and a consensus prediction method. BMC Bioinformatics (2005) 1.41

Sequence-structure matching in globular proteins: application to supersecondary and tertiary structure determination. Proc Natl Acad Sci U S A (1992) 1.40

Characterization and prediction of protein nucleolar localization sequences. Nucleic Acids Res (2010) 1.40

Structural alignment of proteins by a novel TOPOFIT method, as a superimposition of common volumes at a topomax point. Protein Sci (2004) 1.39

Protein secondary structure assignment revisited: a detailed analysis of different assignment methods. BMC Struct Biol (2005) 1.38

Hierarchical classification of protein folds using a novel ensemble classifier. PLoS One (2013) 1.36

Improved amino acid flexibility parameters. Protein Sci (2003) 1.34

BETAWRAP: successful prediction of parallel beta -helices from primary sequence reveals an association with many microbial pathogens. Proc Natl Acad Sci U S A (2001) 1.34

From fold predictions to function predictions: automation of functional site conservation analysis for functional genome predictions. Protein Sci (1999) 1.33

Correction for phylogeny, small number of observations and data redundancy improves the identification of coevolving amino acid pairs using mutual information. Bioinformatics (2009) 1.30

Systematic identification of proteins that elicit drug side effects. Mol Syst Biol (2013) 1.28

Structural analysis of B-cell epitopes in antibody:protein complexes. Mol Immunol (2012) 1.28

Interaction preferences across protein-protein interfaces of obligatory and non-obligatory components are different. BMC Struct Biol (2005) 1.27

Amino acid empirical contact energy definitions for fold recognition in the space of contact maps. BMC Bioinformatics (2003) 1.24

HECTAR: a method to predict subcellular targeting in heterokonts. BMC Bioinformatics (2008) 1.22

Associative memory hamiltonians for structure prediction without homology: alpha-helical proteins. Proc Natl Acad Sci U S A (2000) 1.21

Annotation of tertiary interactions in RNA structures reveals variations and correlations. RNA (2008) 1.21

The Shannon information entropy of protein sequences. Biophys J (1996) 1.21

De-orphaning the structural proteome through reciprocal comparison of evolutionarily important structural features. PLoS One (2008) 1.20

NNAlign: a web-based prediction method allowing non-expert end-user discovery of sequence motifs in quantitative peptide data. PLoS One (2011) 1.20

pvSOAR: detecting similar surface patterns of pocket and void surfaces of amino acid residues on proteins. Nucleic Acids Res (2004) 1.20

Evidence for nonrandom hydrophobicity structures in protein chains. Proc Natl Acad Sci U S A (1996) 1.19

Secreted protein prediction system combining CJ-SPHMM, TMHMM, and PSORT. Mamm Genome (2003) 1.19

A virulent parent with probiotic progeny: comparative genomics of Escherichia coli strains CFT073, Nissle 1917 and ABU 83972. Mol Genet Genomics (2010) 1.17

Comprehensive genome analysis of 203 genomes provides structural genomics with new insights into protein family space. Nucleic Acids Res (2006) 1.17

Predicting the topology of transmembrane helical proteins using mean burial propensity and a hidden-Markov-model-based method. Protein Sci (2003) 1.17

Enhancing the stability and solubility of TEV protease using in silico design. Protein Sci (2007) 1.17

Evolutionary trace annotation of protein function in the structural proteome. J Mol Biol (2009) 1.16

kClust: fast and sensitive clustering of large protein sequence databases. BMC Bioinformatics (2013) 1.15

ArchDB: automated protein loop classification as a tool for structural genomics. Nucleic Acids Res (2004) 1.12

Improving the performance of DomainParser for structural domain partition using neural network. Nucleic Acids Res (2003) 1.12

Thermodynamics of beta-sheet formation in polyglutamine. Biophys J (2009) 1.07

Chloroplast transit peptide prediction: a peek inside the black box. Nucleic Acids Res (2001) 1.07

The dependence of all-atom statistical potentials on structural training database. Biophys J (2004) 1.06

Articles by these authors

Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers (1983) 99.69

Protein structure comparison by alignment of distance matrices. J Mol Biol (1993) 34.74

Database of homology-derived protein structures and the structural meaning of sequence alignment. Proteins (1991) 32.50

Errors in protein structures. Nature (1996) 18.70

Touring protein fold space with Dali/FSSP. Nucleic Acids Res (1998) 18.00

Prediction of protein secondary structure at better than 70% accuracy. J Mol Biol (1993) 17.53

Dali: a network tool for protein structure comparison. Trends Biochem Sci (1995) 14.13

Automated genome sequence analysis and annotation. Bioinformatics (1999) 13.92

Mapping the protein universe. Science (1996) 13.72

Combining evolutionary information and neural networks to predict protein secondary structure. Proteins (1994) 9.97

The HSSP database of protein structure-sequence alignments and family profiles. Nucleic Acids Res (1998) 9.59

New structure--novel fold? Structure (1997) 9.19

Rb targets histone H3 methylation and HP1 to promoters. Nature (2001) 8.26

Enlarged representative set of protein structures. Protein Sci (1994) 7.98

PHD--an automatic mail server for protein secondary structure prediction. Comput Appl Biosci (1994) 6.71

An ATPase domain common to prokaryotic cell cycle proteins, sugar kinases, actin, and hsp70 heat shock proteins. Proc Natl Acad Sci U S A (1992) 6.18

Positioning hydrogen atoms by optimizing hydrogen-bond networks in protein structures. Proteins (1996) 5.83

Completeness in structural genomics. Nat Struct Biol (2001) 5.36

Correlated mutations and residue contacts in proteins. Proteins (1994) 5.20

A yeast gene encoding a protein homologous to the human c-has/bas proto-oncogene product. Nature (1984) 5.19

Transmembrane helices predicted at 95% accuracy. Protein Sci (1995) 5.05

The FSSP database of structurally aligned protein fold families. Nucleic Acids Res (1994) 4.94

A database of protein structure families with common folding motifs. Protein Sci (1992) 4.77

The primary structure of transcription factor TFIIIA has 12 consecutive repeats. FEBS Lett (1985) 4.73

Protein normal-mode dynamics: trypsin inhibitor, crambin, ribonuclease and lysozyme. J Mol Biol (1985) 4.56

MView: a web-compatible database search or multiple alignment viewer. Bioinformatics (1998) 4.50

The immunoglobulin fold. Structural classification, sequence patterns and common core. J Mol Biol (1994) 4.43

An evolutionary treasure: unification of a broad set of amidohydrolases related to urease. Proteins (1997) 4.42

Conservation and prediction of solvent accessibility in protein families. Proteins (1994) 4.29

Ubiquitination of a new form of alpha-synuclein by parkin from human brain: implications for Parkinson's disease. Science (2001) 4.18

An integrated genomic analysis of lung cancer reveals loss of DUSP4 in EGFR-mutant tumors. Oncogene (2009) 4.10

On the use of sequence homologies to predict protein structure: identical pentapeptides can have completely different conformations. Proc Natl Acad Sci U S A (1984) 4.08

T-cell receptor V beta use predicts reactivity and tolerance to Mlsa-encoded antigens. Nature (1988) 4.04

Improved prediction of protein secondary structure by use of sequence profiles and neural networks. Proc Natl Acad Sci U S A (1993) 3.92

Searching protein structure databases has come of age. Proteins (1994) 3.87

Database algorithm for generating protein backbone and side-chain co-ordinates from a C alpha trace application to model building and detection of co-ordinate errors. J Mol Biol (1991) 3.81

Removing near-neighbour redundancy from large protein sequence collections. Bioinformatics (1998) 3.47

Bioinformatics: from genome data to biological knowledge. Curr Opin Biotechnol (1997) 3.44

Dictionary of recurrent domains in protein structures. Proteins (1998) 3.40

Differences in genotypes of Helicobacter pylori from different human populations. J Bacteriol (2000) 3.39

Protein folds and families: sequence and structure alignments. Nucleic Acids Res (1999) 3.38

The ras protein family: evolutionary tree and role of conserved amino acids. Biochemistry (1991) 3.38

Heart-rate turbulence after ventricular premature beats as a predictor of mortality after acute myocardial infarction. Lancet (1999) 3.38

Dali/FSSP classification of three-dimensional protein folds. Nucleic Acids Res (1997) 3.34

A method to predict functional residues in proteins. Nat Struct Biol (1995) 3.33

Parser for protein folding units. Proteins (1994) 3.25

Convergent evolution of similar enzymatic function on different protein folds: the hexokinase, ribokinase, and galactokinase families of sugar kinases. Protein Sci (1993) 3.25

Concerns about game ranching. Can Vet J (1990) 3.07

A sequence property approach to searching protein databases. J Mol Biol (1995) 2.86

Yeast chromosome III: new gene functions. EMBO J (1994) 2.85

Fast and simple Monte Carlo algorithm for side chain optimization in proteins: application to model building by homology. Proteins (1992) 2.69

Roles of topoisomerases in maintaining steady-state DNA supercoiling in Escherichia coli. J Biol Chem (2000) 2.58

Neurotrophin-3 enhances sprouting of corticospinal tract during development and after adult spinal cord lesion. Nature (1994) 2.57

Dipoles of the alpha-helix and beta-sheet: their role in protein folding. Nature (1981) 2.56

Detection of common three-dimensional substructures in proteins. Proteins (1991) 2.53

Evaluation of protein models by atomic solvation preference. J Mol Biol (1992) 2.53

Protein fold recognition by prediction-based threading. J Mol Biol (1997) 2.51

Molecular cloning of YPT1/SEC4-related cDNAs from an epithelial cell line. Mol Cell Biol (1990) 2.51

CAST: an iterative algorithm for the complexity analysis of sequence tracts. Complexity analysis of sequence tracts. Bioinformatics (2000) 2.50

The PDBFINDER database: a summary of PDB, DSSP and HSSP information with added value. Comput Appl Biosci (1996) 2.46

A sensitive and rapid gel retention assay for nuclear factor I and other DNA-binding proteins in crude nuclear extracts. Nucleic Acids Res (1986) 2.45

The HSSP database of protein structure-sequence alignments. Nucleic Acids Res (1997) 2.44

DNA polymerase beta belongs to an ancient nucleotidyltransferase superfamily. Trends Biochem Sci (1995) 2.41

A novel RNA-binding motif in omnipotent suppressors of translation termination, ribosomal proteins and a ribosome modification enzyme? Nucleic Acids Res (1994) 2.37

Fluconazole prophylaxis prevents intra-abdominal candidiasis in high-risk surgical patients. Crit Care Med (1999) 2.35

Progress in protein structure prediction? Trends Biochem Sci (1993) 2.34

Can three-dimensional contacts in protein structures be predicted by analysis of correlated mutations? Protein Eng (1994) 2.30

Hepatitis C virus core protein binds to the cytoplasmic domain of tumor necrosis factor (TNF) receptor 1 and enhances TNF-induced apoptosis. J Virol (1998) 2.28

Predicting protein structure using hidden Markov models. Proteins (1997) 2.20

The use of a token system in project Head Start. J Appl Behav Anal (1970) 2.20

The FSSP database: fold classification based on structure-structure alignment of proteins. Nucleic Acids Res (1996) 2.10

Survey of animal neoplasms in Alameda and Contra Costa Counties, California. II. Cancer morbidity in dogs and cats from Alameda County. J Natl Cancer Inst (1968) 2.08

How good are predictions of protein secondary structure? FEBS Lett (1983) 2.08

The GeneQuiz web server: protein functional analysis through the Web. Trends Biochem Sci (2000) 2.07

Bacterial community associated with Pfiesteria-like dinoflagellate cultures. Environ Microbiol (2001) 2.06

MID1, mutated in Opitz syndrome, encodes an ubiquitin ligase that targets phosphatase 2A for degradation. Nat Genet (2001) 2.04

The ten helical twist angles of B-DNA. Nucleic Acids Res (1982) 2.02

Correlation between the structure and biochemical activities of FtsA, an essential cell division protein of the actin family. EMBO J (1994) 2.01

First pass annotation of promoters on human chromosome 22. Genome Res (2001) 1.99

Genome sequences and great expectations. Genome Biol (2000) 1.98

A supernova origin for dust in a high-redshift quasar. Nature (2004) 1.97

The HSSP database of protein structure-sequence alignments. Nucleic Acids Res (1994) 1.96

Clinical features and outcome of pediatric Wegener's granulomatosis. Arthritis Rheum (2007) 1.91

TFIIB, an evolutionary link between the transcription machineries of archaebacteria and eukaryotes. Cell (1992) 1.91

Analysis of 27 mammalian and 9 avian PrPs reveals high conservation of flexible regions of the prion protein. J Mol Biol (1999) 1.90

Homeopathic proving symptoms: result of a local, non-local, or placebo process? A blinded, placebo-controlled pilot study. Homeopathy (2004) 1.88

Deletion of self-reactive T cells before entry into the thymus medulla. Nature (1988) 1.88

Positive selection of CD4+ thymocytes controlled by MHC class II gene products. Nature (1988) 1.86

Gluten subfractions in coeliac disease. Lancet (1972) 1.86

Redefining the goals of protein secondary structure prediction. J Mol Biol (1994) 1.84

Decision support system for the evolutionary classification of protein structures. Proc Int Conf Intell Syst Mol Biol (1997) 1.80

Osteoid osteomas of the femoral neck: report of four cases evaluated with isotopic bone scanning, CT, and MR imaging. Radiology (1993) 1.76