Automated assembly of protein blocks for database searching.

PubWeight™: 10.84‹?› | Rank: Top 0.1%

🔗 View Article (PMC 329220)

Published in Nucleic Acids Res on December 11, 1991

Authors

S Henikoff1, J G Henikoff

Author Affiliations

1: Howard Hughes Medical Institute, Fred Hutchinson Cancer Research Center, Seattle, WA 98104.

Articles citing this

Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci U S A (1992) 61.33

Predicting deleterious amino acid substitutions. Genome Res (2001) 28.95

PANTHER: a library of protein families and subfamilies indexed by function. Genome Res (2003) 21.64

GenDB--an open source genome annotation system for prokaryote genomes. Nucleic Acids Res (2003) 18.88

Detection of conserved segments in proteins: iterative scanning of sequence databases with alignment blocks. Proc Natl Acad Sci U S A (1994) 18.46

Embedding strategies for effective use of information from multiple sequence alignments. Protein Sci (1997) 11.25

Recognition of related proteins by iterative template refinement (ITR). Protein Sci (1994) 9.67

Modular arrangement of proteins as inferred from analysis of homology. Protein Sci (1994) 9.38

Increased coverage of protein families with the blocks database servers. Nucleic Acids Res (2000) 9.18

The PROSITE database, its status in 1997. Nucleic Acids Res (1997) 8.12

Finding flexible patterns in unaligned protein sequences. Protein Sci (1995) 7.27

Gibbs motif sampling: detection of bacterial outer membrane protein repeats. Protein Sci (1995) 5.96

Searching databases of conserved sequence regions by aligning protein multiple-alignments. Nucleic Acids Res (1996) 5.68

The PROSITE database, its status in 1995. Nucleic Acids Res (1996) 5.53

PROSITE: recent developments. Nucleic Acids Res (1994) 4.46

Histone acetyltransferase activity of yeast Gcn5p is required for the activation of target genes in vivo. Genes Dev (1998) 4.23

Escherichia coli FtsH is a membrane-bound, ATP-dependent protease which degrades the heat-shock transcription factor sigma 32. EMBO J (1995) 3.18

The lin-15 locus encodes two negative regulators of Caenorhabditis elegans vulval development. Mol Biol Cell (1994) 3.04

Recent enhancements to the Blocks Database servers. Nucleic Acids Res (1997) 3.01

PRINTS--a database of protein motif fingerprints. Nucleic Acids Res (1994) 2.86

A general method for identifying recessive diploid-specific mutations in Saccharomyces cerevisiae, its application to the isolation of mutants blocked at intermediate stages of meiotic prophase and characterization of a new gene SAE2. Genetics (1997) 2.72

Regulators of aerobic and anaerobic respiration in Bacillus subtilis. J Bacteriol (1996) 2.63

Molecular cloning and sequence analysis of expansins--a highly conserved, multigene family of proteins that mediate cell wall extension in plants. Proc Natl Acad Sci U S A (1995) 2.47

Cloning and characterization of a gene whose product is a trans-activator of anthrax toxin synthesis. J Bacteriol (1993) 2.46

Genes required for cellulose synthesis in Agrobacterium tumefaciens. J Bacteriol (1995) 2.42

Characterization of the desulfurization genes from Rhodococcus sp. strain IGTS8. J Bacteriol (1994) 2.10

The I/LWEQ module: a conserved sequence that signifies F-actin binding in functionally diverse proteins from yeast to mammals. Proc Natl Acad Sci U S A (1997) 2.06

Microbial relatives of the seed storage proteins of higher plants: conservation of structure and diversification of function during evolution of the cupin superfamily. Microbiol Mol Biol Rev (2000) 2.03

Sequencing and analysis of the prolate-headed lactococcal bacteriophage c2 genome and identification of the structural genes. Appl Environ Microbiol (1995) 1.95

Requirement for genes with homology to ABC transport systems for attachment and virulence of Agrobacterium tumefaciens. J Bacteriol (1996) 1.93

HCP-4, a CENP-C-like protein in Caenorhabditis elegans, is required for resolution of sister centromeres. J Cell Biol (2001) 1.81

Transient, meiosis-induced expression of the rec6 and rec12 genes of Schizosaccharomyces pombe. Genetics (1994) 1.79

Identification and characterization of a sphere organelle protein. J Cell Biol (1993) 1.78

Superior performance in protein homology detection with the Blocks Database servers. Nucleic Acids Res (1998) 1.77

AtVPS34, a phosphatidylinositol 3-kinase of Arabidopsis thaliana, is an essential protein with homology to a calcium-dependent lipid binding domain. Proc Natl Acad Sci U S A (1994) 1.72

Evolutionary relationship between K(+) channels and symporters. Biophys J (1999) 1.63

Expression of a polygalacturonase associated with tomato seed germination. Plant Physiol (1999) 1.59

PolyDoms: a whole genome database for the identification of non-synonymous coding SNPs with the potential to impact disease. Nucleic Acids Res (2006) 1.58

Cloning and characterization of PRA1, a gene encoding a novel pH-regulated antigen of Candida albicans. J Bacteriol (1998) 1.58

Mitochondrial morphological and functional defects in yeast caused by yme1 are suppressed by mutation of a 26S protease subunit homologue. Mol Biol Cell (1994) 1.57

Emerging themes in IFN-gamma-induced macrophage immunity by the p47 and p65 GTPase families. Immunobiology (2007) 1.54

Recognition of nascent RNA by the human La antigen: conserved and divergent features of structure and function. Mol Cell Biol (2001) 1.43

A novel family of TRF (DNA topoisomerase I-related function) genes required for proper nuclear segregation. Nucleic Acids Res (1996) 1.41

The Blocks database--a system for protein classification. Nucleic Acids Res (1996) 1.38

Cloning of the Schizosaccharomyces pombe gene encoding diadenosine 5',5"'-P1,P4-tetraphosphate (Ap4A) asymmetrical hydrolase: sequence similarity with the histidine triad (HIT) protein family. Biochem J (1995) 1.38

BadR, a new MarR family member, regulates anaerobic benzoate degradation by Rhodopseudomonas palustris in concert with AadR, an Fnr family member. J Bacteriol (1999) 1.28

The lonS gene regulates swarmer cell differentiation of Vibrio parahaemolyticus. J Bacteriol (1997) 1.27

Regulation of the sol locus genes for butanol and acetone formation in Clostridium acetobutylicum ATCC 824 by a putative transcriptional repressor. J Bacteriol (1999) 1.25

Discovering active motifs in sets of related protein sequences and using them for classification. Nucleic Acids Res (1994) 1.24

Functional analysis of the early steps of carotenoid biosynthesis in tobacco. Plant Physiol (2002) 1.22

Having a BLAST with bioinformatics (and avoiding BLASTphemy). Genome Biol (2001) 1.15

Phylogeny of protein-folding trajectories reveals a unique pathway to native structure. Proc Natl Acad Sci U S A (2004) 1.10

Discovering structural correlations in alpha-helices. Protein Sci (1994) 1.10

The SBASE protein domain library, release 3.0: a collection of annotated protein sequence segments. Nucleic Acids Res (1994) 1.08

The SBASE protein domain library, release 2.0: a collection of annotated protein sequence segments. Nucleic Acids Res (1993) 1.07

Drosophila genomic sequence annotation using the BLOCKS+ database. Genome Res (2000) 1.04

Attachment of Agrobacterium tumefaciens to carrot cells and Arabidopsis wound sites is correlated with the presence of a cell-associated, acidic polysaccharide. J Bacteriol (1997) 1.04

Progress with the PRINTS protein fingerprint database. Nucleic Acids Res (1996) 1.03

Substitution of Asp-309 by Asn in the Arg-Asp-Pro (RDP) motif of Acetobacter diazotrophicus levansucrase affects sucrose hydrolysis, but not enzyme specificity. Biochem J (1999) 1.01

Cloning, analysis, and overexpression of the gene encoding isobutylamine N-hydroxylase from the valanimycin producer, Streptomyces viridifaciens. J Bacteriol (1997) 0.99

Evidence that the RdeA protein is a component of a multistep phosphorelay modulating rate of development in Dictyostelium. EMBO J (1998) 0.98

Molecular analysis of an enhancin gene in the Lymantria dispar nuclear polyhedrosis virus. J Virol (1997) 0.96

Multiple domain protein diagnostic patterns. Protein Sci (1996) 0.95

Genome bias influences amino acid choices: analysis of amino acid substitution and re-compilation of substitution matrices exclusive to an AT-biased genome. Nucleic Acids Res (2008) 0.94

A homology identification method that combines protein sequence and structure information. Protein Sci (1998) 0.92

mBLAST: Keeping up with the sequencing explosion for (meta)genome analysis. J Data Mining Genomics Proteomics (2013) 0.92

PHOG-BLAST--a new generation tool for fast similarity search of protein families. BMC Evol Biol (2006) 0.91

iMOT: an interactive package for the selection of spatially interacting motifs. Nucleic Acids Res (2004) 0.88

A putative homolog of U2AF65 in S. cerevisiae. Nucleic Acids Res (1992) 0.88

Chimeric tRNAs as tools to induce proteome damage and identify components of stress responses. Nucleic Acids Res (2009) 0.88

Molecular characterization of the Erwinia chrysanthemi kdgK gene involved in pectin degradation. J Bacteriol (1994) 0.87

The three heavy-chain precursors for the inter-alpha-inhibitor family in mouse: new members of the multicopper oxidase protein group with differential transcription in liver and brain. Biochem J (1995) 0.86

Genes optimized by evolution for accurate and fast translation encode in Archaea and Bacteria a broad and characteristic spectrum of protein functions. BMC Genomics (2010) 0.86

Characterization of in vitro DNA binding sites of the EUO protein of Chlamydia psittaci. Infect Immun (2000) 0.86

PCAS--a precomputed proteome annotation database resource. BMC Genomics (2003) 0.86

Mutations in the Corynebacterium glutamicum proline biosynthetic pathway: a natural bypass of th proA step. J Bacteriol (1996) 0.85

Cloning and characterization of CSP37, a novel gene encoding a putative membrane protein of Candida albicans. J Bacteriol (1997) 0.84

Characterization of the PNT1 pentamidine resistance gene of Saccharomyces cerevisiae. Antimicrob Agents Chemother (1994) 0.84

A nuclear juvenile hormone-binding protein from larvae of Manduca sexta: a putative receptor for the metamorphic action of juvenile hormone. Proc Natl Acad Sci U S A (1994) 0.83

Multicopy suppression by asd gene and osmotic stress-dependent complementation by heterologous proA in proA mutants. J Bacteriol (1995) 0.83

Tools and resources for identifying protein families, domains and motifs. Genome Biol (2001) 0.82

A novel substitution matrix fitted to the compositional bias in Mollicutes improves the prediction of homologous relationships. BMC Bioinformatics (2011) 0.81

Evolution of spliceosomal introns following endosymbiotic gene transfer. BMC Evol Biol (2010) 0.81

Structural conservation of a short, functional, peptide-sequence motif. Front Biosci (Landmark Ed) (2009) 0.78

In silico identification of functional protein interfaces. Comp Funct Genomics (2003) 0.78

Feature amplified voting algorithm for functional analysis of protein superfamily. BMC Genomics (2010) 0.78

Bayesian profiling of molecular signatures to predict event times. Theor Biol Med Model (2007) 0.77

Purification of a Zn-binding phloem protein with sequence identity to chitin-binding proteins. Plant Physiol (1996) 0.76

TransCent: computational enzyme design by transferring active sites and considering constraints relevant for catalysis. BMC Bioinformatics (2009) 0.76

MotViz: a tool for sequence motif prediction in parallel to structural visualization and analyses. Genomics Proteomics Bioinformatics (2012) 0.75

Optimal neighborhood indexing for protein similarity search. BMC Bioinformatics (2008) 0.75

Prediction and Fourier-transform infrared-spectroscopy estimation of the secondary structure of a recombinant beta-glucosidase from Streptomyces sp. (ATCC 11238). Biochem J (1995) 0.75

QuaBingo: A Prediction System for Protein Quaternary Structure Attributes Using Block Composition. Biomed Res Int (2016) 0.75

Searching for factors that distinguish disease-prone and disease-resistant prions via sequence analysis. Bioinform Biol Insights (2008) 0.75

Predicting Amino Acid Substitution Probabilities Using Single Nucleotide Polymorphisms. Genetics (2017) 0.75

Articles cited by this

Profile analysis: detection of distantly related proteins. Proc Natl Acad Sci U S A (1987) 29.26

Rapid and sensitive sequence comparison with FASTP and FASTA. Methods Enzymol (1990) 17.64

A workbench for multiple alignment construction and analysis. Proteins (1991) 16.96

Identification of protein sequence homology by consensus template alignment. J Mol Biol (1986) 13.73

Automatic generation of primary sequence patterns from sets of related protein sequences. Proc Natl Acad Sci U S A (1990) 12.96

Detecting homology of distantly related proteins with consensus sequences. J Mol Biol (1987) 11.70

A flexible method to align large numbers of biological sequences. J Mol Evol (1989) 11.34

The SWISS-PROT protein sequence data bank. Nucleic Acids Res (1991) 9.45

PROSITE: a dictionary of sites and patterns in proteins. Nucleic Acids Res (1991) 9.18

Finding sequence motifs in groups of functionally related proteins. Proc Natl Acad Sci U S A (1990) 7.65

Predictive motifs derived from cytosine methyltransferases. Nucleic Acids Res (1989) 7.63

Finding protein similarities with nucleotide sequence databases. Methods Enzymol (1990) 6.63

Protein database searches for multiple alignments. Proc Natl Acad Sci U S A (1990) 5.52

Multiple sequence alignment of protein families showing low sequence homology: a methodological approach using database pattern-matching discriminators for G-protein-linked receptors. Gene (1991) 3.50

A fast and sensitive multiple sequence alignment algorithm. Comput Appl Biosci (1989) 2.66

Searching for patterns in protein and nucleic acid sequences. Methods Enzymol (1990) 2.39

Phosphotransferase sequence homology. Nature (1987) 2.21

MacPattern: protein pattern searching on the Apple Macintosh. Comput Appl Biosci (1991) 2.20

Agmenellum quadruplicatum M.AquI, a novel modification methylase. J Bacteriol (1990) 2.01

Nucleotide sequence of a cDNA coding for the NADPH-protochlorophyllide oxidoreductase (PCR) of barley (Hordeum vulgare L.) and its expression in Escherichia coli. Mol Gen Genet (1989) 1.89

Understanding structural relationships in proteins of unsolved three-dimensional structure. Proteins (1990) 1.65

Cloning and sequencing of protochlorophyllide reductase. Biochem J (1990) 1.32

Isolation and expression of rat liver sepiapterin reductase cDNA. Proc Natl Acad Sci U S A (1990) 1.31

Pseudomonas cepacia 2,2-dialkylglycine decarboxylase. Sequence and expression in Escherichia coli of structural and repressor genes. J Biol Chem (1990) 1.26

PROMOT: a FORTRAN program to scan protein sequences against a library of known motifs. Comput Appl Biosci (1991) 1.18

Articles by these authors

Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci U S A (1992) 61.33

Position-based sequence weights. J Mol Biol (1994) 24.41

Embedding strategies for effective use of information from multiple sequence alignments. Protein Sci (1997) 11.25

Automated construction and graphical presentation of protein blocks from unaligned sequences. Gene (1995) 6.11

Consensus-degenerate hybrid oligonucleotide primers for amplification of distantly related sequences. Nucleic Acids Res (1998) 5.70

Protein family classification based on searching a database of blocks. Genomics (1994) 5.24

Performance evaluation of amino acid substitution matrices. Proteins (1993) 4.46

The risk for and severity of bleeding complications in elderly patients treated with warfarin. The National Consortium of Anticoagulation Clinics. Ann Intern Med (1996) 3.79

Blocks+: a non-redundant database of protein alignment blocks derived from multiple compilations. Bioinformatics (1999) 3.70

PHAT: a transmembrane-specific substitution matrix. Predicted hydrophobic and transmembrane. Bioinformatics (2000) 3.66

Sequence analysis by electronic mail server. Trends Biochem Sci (1993) 2.94

Superior performance in protein homology detection with the Blocks Database servers. Nucleic Acids Res (1998) 1.77

The Blocks database--a system for protein classification. Nucleic Acids Res (1996) 1.38

Transcriptional activator components and poxvirus DNA-dependent ATPases comprise a single family. Trends Biochem Sci (1993) 1.23

Playing with blocks: some pitfalls of forcing multiple alignments. New Biol (1991) 1.22

A computerized intervention to improve timing of outpatient follow-up: a multicenter randomized trial in patients treated with warfarin. National Consortium of Anticoagulation Clinics. J Gen Intern Med (1994) 1.12

Amino acid substitution matrices. Adv Protein Chem (2000) 0.87

Exploring protein homology with the Blocks server. Trends Genet (1998) 0.78