Identifying protein-binding sites from unaligned DNA fragments.

PubWeight™: 14.45‹?› | Rank: Top 0.1% | All-Time Top 10000

🔗 View Article (PMC 286650)

Published in Proc Natl Acad Sci U S A on February 01, 1989

Authors

G D Stormo1, G W Hartzell

Author Affiliations

1: Department of Molecular, Cellular and Developmental Biology, University of Colorado, Boulder 80309.

Articles citing this

(truncated to the top 100)

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res (1997) 665.31

Detection of conserved segments in proteins: iterative scanning of sequence databases with alignment blocks. Proc Natl Acad Sci U S A (1994) 18.46

RegulonDB (version 4.0): transcriptional regulation, operon organization and growth conditions in Escherichia coli K-12. Nucleic Acids Res (2004) 5.73

A hidden Markov model that finds genes in E. coli DNA. Nucleic Acids Res (1994) 5.50

Additivity in protein-DNA interactions: how good an approximation is it? Nucleic Acids Res (2002) 5.04

Splicing signals in Drosophila: intron size, information content, and consensus sequences. Nucleic Acids Res (1992) 4.93

Integrating regulatory motif discovery and genome-wide expression analysis. Proc Natl Acad Sci U S A (2003) 4.74

Predicting gene regulatory elements in silico on a genomic scale. Genome Res (1998) 4.52

Phylogenetic footprinting of transcription factor binding sites in proteobacterial genomes. Nucleic Acids Res (2001) 4.29

RSAT: regulatory sequence analysis tools. Nucleic Acids Res (2008) 3.93

The complete genomes and proteomes of 27 Staphylococcus aureus bacteriophages. Proc Natl Acad Sci U S A (2005) 3.84

Discovering regulatory elements in non-coding sequences by analysis of spaced dyads. Nucleic Acids Res (2000) 3.72

Extensive low-affinity transcriptional interactions in the yeast genome. Genome Res (2006) 3.67

Searching for and predicting the activity of sites for DNA binding proteins: compilation and analysis of the binding sites for Escherichia coli integration host factor (IHF). Nucleic Acids Res (1990) 3.46

RNA pseudoknots that inhibit human immunodeficiency virus type 1 reverse transcriptase. Proc Natl Acad Sci U S A (1992) 3.46

CisModule: de novo discovery of cis-regulatory modules by hierarchical mixture modeling. Proc Natl Acad Sci U S A (2004) 3.37

PhyloGibbs: a Gibbs sampling motif finder that incorporates phylogeny. PLoS Comput Biol (2005) 3.35

Building a dictionary for genomes: identification of presumptive regulatory sites by statistical analysis. Proc Natl Acad Sci U S A (2000) 3.18

The p53MH algorithm and its application in detecting p53-responsive genes. Proc Natl Acad Sci U S A (2002) 3.11

Computer-assisted prediction, classification, and delimitation of protein binding sites in nucleic acids. Nucleic Acids Res (1993) 3.06

A biophysical approach to transcription factor binding site discovery. Genome Res (2003) 2.62

Comparative genomic reconstruction of transcriptional regulatory networks in bacteria. Chem Rev (2007) 2.40

SwissRegulon: a database of genome-wide annotations of regulatory sites. Nucleic Acids Res (2006) 2.30

Consensus DNA site for the Escherichia coli catabolite gene activator protein (CAP): CAP exhibits a 450-fold higher affinity for the consensus DNA site than for the E. coli lac DNA site. Nucleic Acids Res (1989) 2.24

The evolution of DNA regulatory regions for proteo-gamma bacteria by interspecies comparisons. Genome Res (2002) 2.22

Comparative genomic analysis of 18 Pseudomonas aeruginosa bacteriophages. J Bacteriol (2006) 2.18

Discovering common stem-loop motifs in unaligned RNA sequences. Nucleic Acids Res (2001) 2.12

Identification of the binding sites of regulatory proteins in bacterial genomes. Proc Natl Acad Sci U S A (2002) 2.07

Computational identification of transcriptional regulatory elements in DNA sequence. Nucleic Acids Res (2006) 1.88

Probabilistic clustering of sequences: inferring new bacterial regulons by comparative genomics. Proc Natl Acad Sci U S A (2002) 1.78

Identifying the conserved network of cis-regulatory sites of a eukaryotic genome. Proc Natl Acad Sci U S A (2005) 1.72

On the detection and refinement of transcription factor binding sites using ChIP-Seq data. Nucleic Acids Res (2010) 1.67

Identification of the REST regulon reveals extensive transposable element-mediated binding site duplication. Nucleic Acids Res (2006) 1.62

Improved models for transcription factor binding site identification using nonindependent interactions. Genetics (2012) 1.61

Identification of RNA-protein interaction networks using PAR-CLIP. Wiley Interdiscip Rev RNA (2011) 1.61

Local graph alignment and motif search in biological networks. Proc Natl Acad Sci U S A (2004) 1.56

Saccharomyces genome database provides new regulation data. Nucleic Acids Res (2013) 1.52

Modeling the specificity of protein-DNA interactions. Quant Biol (2013) 1.50

Computational approaches to identify promoters and cis-regulatory elements in plant genomes. Plant Physiol (2003) 1.50

Construction and analysis of a profile library characterizing groups of structurally known proteins. Protein Sci (1996) 1.47

The p53 tumor suppressor network is a key responder to microenvironmental components of chronic inflammatory stress. Cancer Res (2005) 1.42

An efficient algorithm for identifying matches with errors in multiple long molecular sequences. J Mol Biol (1991) 1.32

Differences in LexA regulon structure among Proteobacteria through in vivo assisted comparative genomics. Nucleic Acids Res (2004) 1.30

Non-canonical CRP sites control competence regulons in Escherichia coli and many other gamma-proteobacteria. Nucleic Acids Res (2006) 1.28

Dinucleotide weight matrices for predicting transcription factor binding sites: generalizing the position weight matrix. PLoS One (2010) 1.23

Computational technique for improvement of the position-weight matrices for the DNA/protein binding sites. Nucleic Acids Res (2005) 1.22

From biophysics to evolutionary genetics: statistical aspects of gene regulation. BMC Bioinformatics (2007) 1.22

Identification of context-dependent motifs by contrasting ChIP binding data. Bioinformatics (2010) 1.19

Myc and Mad bHLHZ domains possess identical DNA-binding specificities but only partially overlapping functions in vivo. Proc Natl Acad Sci U S A (2002) 1.18

Multiplatform genome-wide identification and modeling of functional human estrogen receptor binding sites. Genome Biol (2006) 1.17

Ab initio identification of putative human transcription factor binding sites by comparative genomics. BMC Bioinformatics (2005) 1.12

A reexamination of information theory-based methods for DNA-binding site identification. BMC Bioinformatics (2009) 1.12

Analysis of consensus sequence patterns in Giardia cytoskeleton gene promoters. Nucleic Acids Res (1995) 1.09

Finding important sites in protein sequences. Proc Natl Acad Sci U S A (2002) 1.05

Extracting sequence features to predict protein-DNA interactions: a comparative study. Nucleic Acids Res (2008) 1.04

Rewiring of PDZ domain-ligand interaction network contributed to eukaryotic evolution. PLoS Genet (2012) 1.02

Reliable prediction of transcription factor binding sites by phylogenetic verification. Proc Natl Acad Sci U S A (2005) 1.02

Efficient motif finding algorithms for large-alphabet inputs. BMC Bioinformatics (2010) 1.02

Computational identification and functional validation of regulatory motifs in cartilage-expressed genes. Genome Res (2007) 1.01

Fast multiple alignment of ungapped DNA sequences using information theory and a relaxation method. Discrete Appl Math (1996) 0.98

Fur controls iron homeostasis and oxidative stress defense in the oligotrophic alpha-proteobacterium Caulobacter crescentus. Nucleic Acids Res (2009) 0.98

Genome-wide de novo prediction of cis-regulatory binding sites in prokaryotes. Nucleic Acids Res (2009) 0.98

Bipartite pattern discovery by entropy minimization-based multiple local alignment. Nucleic Acids Res (2004) 0.96

PhyloGibbs-MP: module prediction and discriminative motif-finding by Gibbs sampling. PLoS Comput Biol (2008) 0.95

Comparative analysis of regulatory motif discovery tools for transcription factor binding sites. Genomics Proteomics Bioinformatics (2007) 0.95

DNA binding specificity and sequence of Xanthomonas campestris catabolite gene activator protein-like protein. J Bacteriol (1992) 0.94

Fis regulates transcriptional induction of RpoS in Salmonella enterica. J Bacteriol (2005) 0.93

Melina II: a web tool for comparisons among several predictive algorithms to find potential motifs from promoter regions. Nucleic Acids Res (2007) 0.92

Tmod: toolbox of motif discovery. Bioinformatics (2009) 0.91

Systematic identification of conserved motif modules in the human genome. BMC Genomics (2010) 0.91

Evidence classification of high-throughput protocols and confidence integration in RegulonDB. Database (Oxford) (2013) 0.90

COTRASIF: conservation-aided transcription-factor-binding site finder. Nucleic Acids Res (2009) 0.89

RecMotif: a novel fast algorithm for weak motif discovery. BMC Bioinformatics (2010) 0.89

Characterization of the gcd gene from Escherichia coli K-12 W3110 and regulation of its expression. J Bacteriol (1993) 0.86

RNA ligands to human nerve growth factor. Nucleic Acids Res (1995) 0.86

Data Compression Concepts and Algorithms and their Applications to Bioinformatics. Entropy (Basel) (2010) 0.86

Sequence evolution of the intrinsically disordered and globular domains of a model viral oncoprotein. PLoS One (2012) 0.85

Simultaneous prediction of transcription factor binding sites in a group of prokaryotic genomes. BMC Bioinformatics (2010) 0.85

Splicing enhancement in the yeast rp51b intron. RNA (2000) 0.85

Motif Discovery in Physiological Datasets: A Methodology for Inferring Predictive Elements. ACM Trans Knowl Discov Data (2010) 0.85

Better estimation of protein-DNA interaction parameters improve prediction of functional sites. BMC Biotechnol (2008) 0.84

Systematic prediction of cis-regulatory elements in the Chlamydomonas reinhardtii genome using comparative genomics. Plant Physiol (2012) 0.84

Statistical Issues in the Analysis of ChIP-Seq and RNA-Seq Data. Genes (Basel) (2010) 0.83

DNA Motif Detection Using Particle Swarm Optimization and Expectation-Maximization. Proc IEEE Swarm Intell Symp (2005) 0.83

PairMotif+: a fast and effective algorithm for de novo motif discovery in DNA sequences. Int J Biol Sci (2013) 0.83

Characterization of a new tissue-specific transcription factor binding to the simian virus 40 enhancer TC-II (NF-kappa B) element. Mol Cell Biol (1992) 0.82

GANN: genetic algorithm neural networks for the detection of conserved combinations of features in DNA. BMC Bioinformatics (2005) 0.82

Paired hormone response elements predict caveolin-1 as a glucocorticoid target gene. PLoS One (2010) 0.81

Bayesian multiple-instance motif discovery with BAMBI: inference of recombinase and transcription factor binding sites. Nucleic Acids Res (2011) 0.81

PDZ domain-containing 1 (PDZK1) protein regulates phospholipase C-β3 (PLC-β3)-specific activation of somatostatin by forming a ternary complex with PLC-β3 and somatostatin receptors. J Biol Chem (2012) 0.81

Accurate recognition of cis-regulatory motifs with the correct lengths in prokaryotic genomes. Nucleic Acids Res (2009) 0.80

Identification of a DNA structural motif that includes the binding sites for Sp1, p53 and GA-binding protein. Nucleic Acids Res (1993) 0.80

MotifClick: prediction of cis-regulatory binding sites via merging cliques. BMC Bioinformatics (2011) 0.80

Genome-wide identification of transcription factors and transcription-factor binding sites in oleaginous microalgae Nannochloropsis. Sci Rep (2014) 0.80

Identification of cis-regulatory modules in promoters of human genes exploiting mutual positioning of transcription factors. Nucleic Acids Res (2013) 0.80

Multi-alphabet consensus algorithm for identification of low specificity protein-DNA interactions. Nucleic Acids Res (1995) 0.80

A novel swarm intelligence algorithm for finding DNA motifs. Int J Comput Biol Drug Des (2009) 0.80

GPUmotif: an ultra-fast and energy-efficient motif analysis program using graphics processing units. PLoS One (2012) 0.79

Optimizing the GATA-3 position weight matrix to improve the identification of novel binding sites. BMC Genomics (2012) 0.79

Quality of regulatory elements in Drosophila retrogenes. Genomics (2008) 0.79

Articles cited by this

Compilation and analysis of Escherichia coli promoter DNA sequences. Nucleic Acids Res (1983) 38.72

Information content of binding sites on nucleotide sequences. J Mol Biol (1986) 30.48

Profile analysis: detection of distantly related proteins. Proc Natl Acad Sci U S A (1987) 29.26

Computer methods to locate signals in nucleic acid sequences. Nucleic Acids Res (1984) 21.53

Selection of DNA binding sites by regulatory proteins. Statistical-mechanical theory and application to operators and promoters. J Mol Biol (1987) 17.53

Analysis of E. coli promoter sequences. Nucleic Acids Res (1987) 14.12

Cyclic AMP receptor protein: role in transcription activation. Science (1984) 8.37

Escherichia coli promoter sequences predict in vitro RNA polymerase selectivity. Nucleic Acids Res (1984) 8.00

Methods to define and locate patterns of motifs in sequences. Comput Appl Biosci (1988) 5.83

Rigorous pattern-recognition methods for DNA sequences. Analysis of promoter sequences from Escherichia coli. J Mol Biol (1985) 4.95

Multiple sequence alignment. J Mol Biol (1986) 4.27

A perfectly symmetric lac operator binds the lac repressor very tightly. Proc Natl Acad Sci U S A (1983) 4.24

Profile scanning for three-dimensional structural patterns in protein sequences. Comput Appl Biosci (1988) 3.66

Molecular cloning and expression of the biodegradative threonine dehydratase gene (tdc) of Escherichia coli K12. Mol Gen Genet (1985) 2.83

Computer methods for analyzing sequence recognition of nucleic acids. Annu Rev Biophys Biophys Chem (1988) 2.72

A DNA sequence containing the control regions of the malEFG and malK-lamB operons in Escherichia coli K12. Mol Gen Genet (1982) 1.88

The catabolite-sensitive promoter for the chloramphenicol acetyl transferase gene is preceded by two binding sites for the catabolite gene activator protein. J Bacteriol (1982) 1.87

Preliminary X-ray crystallographic analysis of canine parvovirus crystals. J Mol Biol (1988) 1.46

Genome projects ready to go. Science (1988) 1.39

Articles by these authors

The structure and function of the homeodomain. Biochim Biophys Acta (1989) 7.44

Identification of consensus patterns in unaligned DNA sequences known to be functionally related. Comput Appl Biosci (1990) 6.16