Published in Structure on June 10, 2009
Extending CATH: increasing coverage of the protein structure universe and linking structure with function. Nucleic Acids Res (2010) 2.93
Classification of intrinsically disordered regions and proteins. Chem Rev (2014) 2.48
The high-throughput protein sample production platform of the Northeast Structural Genomics Consortium. J Struct Biol (2010) 1.90
Assignment of protein sequences to existing domain and family classification systems: Pfam and the PDB. Bioinformatics (2012) 1.82
Preparation of protein samples for NMR structure, function, and small-molecule screening studies. Methods Enzymol (2011) 1.75
Improving the chances of successful protein structure determination with a random forest classifier. Acta Crystallogr D Biol Crystallogr (2014) 1.50
GeMMA: functional subfamily classification within superfamilies of predicted protein structural domains. Nucleic Acids Res (2009) 1.50
The Protein Structure Initiative: achievements and visions for the future. F1000 Biol Rep (2012) 1.38
Expansion of the protein repertoire in newly explored environments: human gut microbiome specific protein families. PLoS Comput Biol (2010) 1.15
Unmet challenges of structural genomics. Curr Opin Struct Biol (2010) 1.06
Functional site plasticity in domain superfamilies. Biochim Biophys Acta (2013) 1.00
Protein domains of unknown function are essential in bacteria. MBio (2013) 0.99
Structural genomics plucks high-hanging membrane proteins. Curr Opin Struct Biol (2012) 0.98
The Proteome Folding Project: proteome-scale prediction of structure and function. Genome Res (2011) 0.98
RepeatsDB: a database of tandem repeat protein structures. Nucleic Acids Res (2013) 0.97
Sequence-based prediction of protein crystallization, purification and production propensity. Bioinformatics (2011) 0.95
From protein sequences to 3D-structures and beyond: the example of the UniProt knowledgebase. Cell Mol Life Sci (2009) 0.95
Structural analysis of heme proteins: implications for design and prediction. BMC Struct Biol (2011) 0.95
Improved protein surface comparison and application to low-resolution protein structure data. BMC Bioinformatics (2010) 0.95
Structural characteristics of novel protein folds. PLoS Comput Biol (2010) 0.94
High-throughput expression and purification of membrane proteins. J Struct Biol (2010) 0.93
DisMeta: a meta server for construct design and optimization. Methods Mol Biol (2014) 0.92
Crystal structure of cytomegalovirus IE1 protein reveals targeting of TRIM family member PML via coiled-coil interactions. PLoS Pathog (2014) 0.91
Binding of protein kinase inhibitors to synapsin I inferred from pair-wise binding site similarity measurements. PLoS One (2010) 0.90
Target highlights in CASP9: Experimental target structures for the critical assessment of techniques for protein structure prediction. Proteins (2011) 0.89
Revisiting gap locations in amino acid sequence alignments and a proposal for a method to improve them by introducing solvent accessibility. Proteins (2011) 0.89
Utilization of protein intrinsic disorder knowledge in structural proteomics. Biochim Biophys Acta (2012) 0.89
Mathematical model for empirically optimizing large scale production of soluble protein domains. BMC Bioinformatics (2010) 0.87
Towards structural systems pharmacology to study complex diseases and personalized medicine. PLoS Comput Biol (2014) 0.87
1,000 structures and more from the MCSG. BMC Struct Biol (2011) 0.86
Structural representative of the protein family PF14466 has a new fold and establishes links with the C2 and PLAT domains from the widely distant Pfams PF00168 and PF01477. Protein Sci (2013) 0.86
Rigid-body ligand recognition drives cytotoxic T-lymphocyte antigen 4 (CTLA-4) receptor triggering. J Biol Chem (2010) 0.86
An estimated 5% of new protein structures solved today represent a new Pfam family. Acta Crystallogr D Biol Crystallogr (2013) 0.85
Development of a full-length human protein production pipeline. Protein Sci (2014) 0.84
A new approach to assess and predict the functional roles of proteins across all known structures. J Struct Funct Genomics (2011) 0.83
Assessing energetic contributions to binding from a disordered region in a protein-protein interaction . Biochemistry (2010) 0.82
Computational reconstruction of multidomain proteins using atomic force microscopy data. Structure (2012) 0.82
Computational protein design: validation and possible relevance as a tool for homology searching and fold recognition. PLoS One (2010) 0.81
The impact of structural genomics: the first quindecennial. J Struct Funct Genomics (2016) 0.79
Solution NMR structures reveal a distinct architecture and provide first structures for protein domain family PF04536. J Struct Funct Genomics (2011) 0.79
New variants of known folds: do they bring new biology? Acta Crystallogr Sect F Struct Biol Cryst Commun (2010) 0.79
Advances in protein NMR provided by the NIGMS Protein Structure Initiative: impact on drug discovery. Curr Opin Drug Discov Devel (2010) 0.79
High-throughput computational structure-based characterization of protein families: START domains and implications for structural genomics. J Struct Funct Genomics (2010) 0.79
Solution NMR structure of Alr2454 from Nostoc sp. PCC 7120, the first structural representative of Pfam domain family PF11267. J Struct Funct Genomics (2012) 0.78
High-throughput cloning and expression of integral membrane proteins in Escherichia coli. Curr Protoc Protein Sci (2013) 0.78
Solution NMR and X-ray crystal structures of Pseudomonas syringae Pspto_3016 from protein domain family PF04237 (DUF419) adopt a "double wing" DNA binding motif. J Struct Funct Genomics (2012) 0.78
Prediction of DNA binding motifs from 3D models of transcription factors; identifying TLX3 regulated genes. Nucleic Acids Res (2014) 0.78
A more structured metabolome. Nat Struct Mol Biol (2009) 0.78
Computational approaches for rational design of proteins with novel functionalities. Comput Struct Biotechnol J (2012) 0.78
Disease risk of missense mutations using structural inference from predicted function. Curr Protein Pept Sci (2010) 0.77
A community resource of experimental data for NMR / X-ray crystal structure pairs. Protein Sci (2015) 0.77
Internal organization of large protein families: relationship between the sequence, structure, and function-based clustering. Proteins (2011) 0.77
Computational approaches to selecting and optimising targets for structural biology. Methods (2011) 0.77
High-throughput structural biology of metabolic enzymes and its impact on human diseases. J Inherit Metab Dis (2011) 0.77
Structure and computational analysis of a novel protein with metallopeptidase-like and circularly permuted winged-helix-turn-helix domains reveals a possible role in modified polysaccharide biosynthesis. BMC Bioinformatics (2014) 0.76
Quantification of the impact of PSI:Biology according to the annotations of the determined structures. BMC Struct Biol (2013) 0.75
Analyses of the general rule on residue pair frequencies in local amino acid sequences of soluble, ordered proteins. Protein Sci (2013) 0.75
Solution NMR structures provide first structural coverage of the large protein domain family PF08369 and complementary structural coverage of dark operative protochlorophyllide oxidoreductase complexes. J Struct Funct Genomics (2013) 0.75
Structural and functional characterization of DUF1471 domains of Salmonella proteins SrfN, YdgH/SssB, and YahO. PLoS One (2014) 0.75
A global comparison of the human and T. brucei degradomes gives insights about possible parasite drug targets. PLoS Negl Trop Dis (2012) 0.75
Solution NMR structure of the helicase associated domain BVU_0683(627-691) from Bacteroides vulgatus provides first structural coverage for protein domain family PF03457 and indicates domain binding to DNA. J Struct Funct Genomics (2012) 0.75
Solution NMR structure of CD1104B from pathogenic Clostridium difficile reveals a distinct α-helical architecture and provides first structural representative of protein domain family PF14203. J Struct Funct Genomics (2013) 0.75
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res (1997) 665.31
Basic local alignment search tool. J Mol Biol (1990) 659.07
CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res (1994) 392.47
Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet (2000) 336.52
The Protein Data Bank. Nucleic Acids Res (2000) 187.10
KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res (2000) 117.00
SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol (1995) 74.88
NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res (2006) 48.10
Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics (2006) 43.68
The Pfam protein families database. Nucleic Acids Res (2007) 30.53
Metagenomic analysis of the human distal gut microbiome. Science (2006) 29.76
The universal protein resource (UniProt). Nucleic Acids Res (2007) 16.33
The TIGRFAMs database of protein families. Nucleic Acids Res (2003) 13.59
Comparative metagenomics revealed commonly enriched gene sets in human gut microbiomes. DNA Res (2007) 13.08
Hidden Markov models. Curr Opin Struct Biol (1996) 11.56
The ProDom database of protein domain families: more emphasis on 3D. Nucleic Acids Res (2005) 7.66
Evolution of function in protein superfamilies, from a structural perspective. J Mol Biol (2001) 6.63
Metagenomics: genomic analysis of microbial communities. Annu Rev Genet (2004) 6.27
Completeness in structural genomics. Nat Struct Biol (2001) 5.36
The impact of structural genomics: expectations and outcomes. Science (2006) 5.30
E-MSD: an integrated data resource for bioinformatics. Nucleic Acids Res (2005) 4.25
Divergent evolution of enzymatic function: mechanistically diverse superfamilies and functionally distinct suprafamilies. Annu Rev Biochem (2001) 4.17
Comprehensive evaluation of protein structure alignment methods: scoring by geometric measures. J Mol Biol (2005) 4.02
The CATH domain structure database: new protocols and classification levels give a more comprehensive resource for exploring evolution. Nucleic Acids Res (2006) 3.84
A decade of CASP: progress, bottlenecks and prognosis in protein structure prediction. Curr Opin Struct Biol (2005) 3.69
SSAP: sequential structure alignment program for protein structure comparison. Methods Enzymol (1996) 3.25
100,000 protein structures for the biologist. Nat Struct Biol (1998) 3.19
Gene3D: comprehensive structural and functional annotation of genomes. Nucleic Acids Res (2007) 2.75
Leveraging enzyme structure-function relationships for functional inference and experimental design: the structure-function linkage database. Biochemistry (2006) 2.65
Progress of structural genomics initiatives: an analysis of solved target structures. J Mol Biol (2005) 2.22
Structural diversity of domain superfamilies in the CATH database. J Mol Biol (2006) 2.10
The natural history of protein domains. Annu Rev Biophys Biomol Struct (2001) 2.09
The protein structure initiative structural genomics knowledgebase. Nucleic Acids Res (2008) 2.03
Structural and chemical profiling of the human cytosolic sulfotransferases. PLoS Biol (2007) 1.90
Arrangements in the modular evolution of proteins. Trends Biochem Sci (2008) 1.78
Monophyly of class I aminoacyl tRNA synthetase, USPA, ETFP, photolyase, and PP-ATPase nucleotide-binding domains: implications for protein evolution in the RNA. Proteins (2002) 1.67
Novel leverage of structural genomics. Nat Biotechnol (2007) 1.57
Identification and distribution of protein families in 120 completed genomes using Gene3D. Proteins (2005) 1.54
From the first protein structures to our current knowledge of protein folding: delights and scepticisms. Nat Rev Mol Cell Biol (2008) 1.44
Towards a comprehensive structural coverage of completed genomes: a structural genomics viewpoint. BMC Bioinformatics (2007) 1.34
The structure of protein evolution and the evolution of protein structure. Curr Opin Struct Biol (2008) 1.32
Towards fully automated structure-based function prediction in structural genomics: a case study. J Mol Biol (2007) 1.26
Exploring the structure and function paradigm. Curr Opin Struct Biol (2008) 1.26
Sequence clustering strategies improve remote homology recognitions while reducing search times. Protein Eng (2002) 1.25
Update on the protein structure initiative. Structure (2007) 1.23
CHOP proteins into structural domain-like fragments. Proteins (2004) 1.21
Comprehensive genome analysis of 203 genomes provides structural genomics with new insights into protein family space. Nucleic Acids Res (2006) 1.17
Protein superfamily evolution and the last universal common ancestor (LUCA). J Mol Evol (2006) 1.17
Origins and impact of constraints in evolution of gene families. Genome Res (2006) 1.16
Target selection for structural genomics: an overview. Methods Mol Biol (2008) 1.07
A Protein Structure (or Function ?) Initiative. Structure (2007) 0.89
New dimensions of structural proteomics: exploring chemical and biological space. Structure (2007) 0.80
Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics (2006) 43.68
InterPro: the integrative protein signature database. Nucleic Acids Res (2008) 25.07
SNAP predicts effect of mutations on protein function. Bioinformatics (2008) 15.39
The Sorcerer II Global Ocean Sampling expedition: expanding the universe of protein families. PLoS Biol (2007) 13.99
InterPro in 2011: new developments in the family and domain prediction database. Nucleic Acids Res (2011) 13.45
New developments in the InterPro database. Nucleic Acids Res (2007) 12.49
A combined computational-experimental approach predicts human microRNA targets. Genes Dev (2004) 9.82
SNAP: predict effect of non-synonymous polymorphisms on function. Nucleic Acids Res (2007) 8.46
Protein tyrosine phosphatases in the human genome. Cell (2004) 8.09
The importance of alignment accuracy for molecular replacement. Acta Crystallogr D Biol Crystallogr (2004) 6.49
Structural genomics of the Thermotoga maritima proteome implemented in a high-throughput structure determination pipeline. Proc Natl Acad Sci U S A (2002) 6.00
Human embryonic stem cells derived by somatic cell nuclear transfer. Cell (2013) 5.91
The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis. Nucleic Acids Res (2005) 5.59
Tolerating some redundancy significantly speeds up clustering of large protein databases. Bioinformatics (2002) 5.46
Flexible structure alignment by chaining aligned fragment pairs allowing twists. Bioinformatics (2003) 5.46
Improving the prediction of protein secondary structure in three and eight classes using recurrent neural networks and profiles. Proteins (2002) 5.29
FFAS03: a server for profile--profile sequence alignments. Nucleic Acids Res (2005) 4.99
The NLR gene family: a standard nomenclature. Immunity (2008) 4.77
A large-scale evaluation of computational protein function prediction. Nat Methods (2013) 4.61
MODBASE, a database of annotated comparative protein structure models, and associated resources. Nucleic Acids Res (2004) 4.54
A mutation in VPS35, encoding a subunit of the retromer complex, causes late-onset Parkinson disease. Am J Hum Genet (2011) 4.52
A primer on metagenomics. PLoS Comput Biol (2010) 4.40
Financial expectations of first-year veterinary students. J Am Vet Med Assoc (2015) 4.38
S-nitrosylation of Drp1 mediates beta-amyloid-related mitochondrial fission and neuronal injury. Science (2009) 4.25
Protein production and purification. Nat Methods (2008) 3.97
Successful transfer of open surgical skills to a laparoscopic environment using a robotic interface: initial experience with laparoscopic radical prostatectomy. J Urol (2003) 3.93
Towards germline gene therapy of inherited mitochondrial diseases. Nature (2012) 3.73
Tools for comparative protein structure modeling and analysis. Nucleic Acids Res (2003) 3.68
Genome-scale analysis of in vivo spatiotemporal promoter activity in Caenorhabditis elegans. Nat Biotechnol (2007) 3.65
Shotgun metaproteomics of the human distal gut microbiota. ISME J (2008) 3.43
Exploration of uncharted regions of the protein universe. PLoS Biol (2009) 3.41
Predicting protein function from sequence and structure. Nat Rev Mol Cell Biol (2007) 3.37
New functional families (FunFams) in CATH to improve the mapping of conserved functional sites to 3D structures. Nucleic Acids Res (2012) 3.36
A Blueprint for HIV Vaccine Discovery. Cell Host Microbe (2012) 3.27
Critical assessment of methods of protein structure prediction-Round VII. Proteins (2007) 3.23
High-throughput in vivo analysis of gene expression in Caenorhabditis elegans. PLoS Biol (2007) 3.22
Mimicking cellular sorting improves prediction of subcellular localization. J Mol Biol (2005) 3.21
Novel surveillance network for norovirus gastroenteritis outbreaks, United States. Emerg Infect Dis (2011) 3.06
The amphioxus genome illuminates vertebrate origins and cephalochordate biology. Genome Res (2008) 3.04
Three-dimensional structures of membrane proteins from genomic sequencing. Cell (2012) 3.02
Analysing six types of protein-protein interfaces. J Mol Biol (2003) 2.96
Gene3D: comprehensive structural and functional annotation of genomes. Nucleic Acids Res (2007) 2.75
The CATH extended protein-family database: providing structural annotations for genome sequences. Protein Sci (2002) 2.69
Quantifying the similarities within fold space. J Mol Biol (2002) 2.65
Critical assessment of methods of protein structure prediction (CASP)--round 6. Proteins (2005) 2.58
Alignments grow, secondary structure prediction improves. Proteins (2002) 2.56
EVA: Evaluation of protein structure prediction servers. Nucleic Acids Res (2003) 2.50
FATCAT: a web server for flexible structure comparison and structure similarity searching. Nucleic Acids Res (2004) 2.49
Mitochondrial fission in apoptosis, neurodegeneration and aging. Curr Opin Cell Biol (2003) 2.44
CutDB: a proteolytic event database. Nucleic Acids Res (2006) 2.41
Reliability of assessment of protein structure prediction methods. Structure (2002) 2.39
Sequence conserved for subcellular localization. Protein Sci (2002) 2.36
Automatic target selection for structural genomics on eukaryotes. Proteins (2004) 2.34
Gene3D: merging structure and function for a Thousand genomes. Nucleic Acids Res (2009) 2.34
Three-dimensional structural view of the central metabolic network of Thermotoga maritima. Science (2009) 2.33
Outcome of a workshop on archiving structural models of biological macromolecules. Structure (2006) 2.32
The protein target list of the Northeast Structural Genomics Consortium. Proteins (2004) 2.31
Gene3D: modelling protein structure, function and evolution. Nucleic Acids Res (2006) 2.30
Transmembrane helix predictions revisited. Protein Sci (2002) 2.28
Pre-calculated protein structure alignments at the RCSB PDB website. Bioinformatics (2010) 2.28
XtalPred: a web server for prediction of protein crystallizability. Bioinformatics (2007) 2.26
TOPS++FATCAT: fast flexible structural alignment using constraints derived from TOPS+ Strings Model. BMC Bioinformatics (2008) 2.26
Critical assessment of methods of protein structure prediction - Round VIII. Proteins (2009) 2.25
The PAAD/PYRIN-only protein POP1/ASC2 is a modulator of ASC-mediated nuclear-factor-kappa B and pro-caspase-1 regulation. Biochem J (2003) 2.25
Protein function annotation by homology-based inference. Genome Biol (2009) 2.20
The domains of apoptosis: a genomics perspective. Sci STKE (2004) 2.15
Protein-protein interaction hotspots carved into sequences. PLoS Comput Biol (2007) 2.15
Case of yellow fever vaccine--associated viscerotropic disease with prolonged viremia, robust adaptive immune responses, and polymorphisms in CCR5 and RANTES genes. J Infect Dis (2008) 2.14
Probing metagenomics by rapid cluster analysis of very large datasets. PLoS One (2008) 2.11
Electron paramagnetic resonance oxygen images correlate spatially and quantitatively with Oxylite oxygen measurements. Clin Cancer Res (2006) 2.10
Automatic detection of subsystem/pathway variants in genome analysis. Bioinformatics (2005) 2.10
Transient protein-protein interactions: structural, functional, and network properties. Structure (2010) 2.07
Outcome of a workshop on applications of protein models in biomedical research. Structure (2009) 2.05
PROFbval: predict flexible and rigid residues in proteins. Bioinformatics (2006) 2.04
The protein structure initiative structural genomics knowledgebase. Nucleic Acids Res (2008) 2.03
Domains, motifs and clusters in the protein universe. Curr Opin Chem Biol (2003) 2.02
Microvesicle entry into marrow cells mediates tissue-specific changes in mRNA by direct delivery of mRNA and induction of transcription. Exp Hematol (2010) 2.01
Predicted protein-protein interaction sites from local sequence information. FEBS Lett (2003) 2.00
The retinitis pigmentosa GTPase regulator (RPGR)- interacting protein: subserving RPGR function and participating in disk morphogenesis. Proc Natl Acad Sci U S A (2003) 1.96
Loopy proteins appear conserved in evolution. J Mol Biol (2002) 1.96
Insights into the mechanism of microtubule stabilization by Taxol. Proc Natl Acad Sci U S A (2006) 1.96
Comparative analysis of protein domain organization. Genome Res (2004) 1.94
Identification and characterization of a novel bacterial virulence factor that shares homology with mammalian Toll/interleukin-1 receptor family proteins. Infect Immun (2006) 1.91
Between order and disorder in protein structures: analysis of "dual personality" fragments in proteins. Structure (2007) 1.88
Protein names precisely peeled off free text. Bioinformatics (2004) 1.87
The JCSG high-throughput structural biology pipeline. Acta Crystallogr Sect F Struct Biol Cryst Commun (2010) 1.87
Cancer incidence in first generation U.S. Hispanics: Cubans, Mexicans, Puerto Ricans, and new Latinos. Cancer Epidemiol Biomarkers Prev (2009) 1.85
Genotypic and epidemiologic trends of norovirus outbreaks in the United States, 2009 to 2013. J Clin Microbiol (2013) 1.84