Modeling the specificity of protein-DNA interactions.

PubWeight™: 1.50‹?› | Rank: Top 4%

🔗 View Article (PMID 25045190)

Published in Quant Biol on June 01, 2013

Authors

Gary D Stormo

Articles citing this

JASPAR 2014: an extensively expanded and updated open-access database of transcription factor binding profiles. Nucleic Acids Res (2013) 6.12

JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles. Nucleic Acids Res (2015) 2.22

Absence of a simple code: how transcription factors read the genome. Trends Biochem Sci (2014) 1.84

TFBSshape: a motif database for DNA shape features of transcription factor binding sites. Nucleic Acids Res (2013) 1.60

The next generation of transcription factor binding site prediction. PLoS Comput Biol (2013) 1.55

Quantitative modeling of transcription factor binding specificities using DNA shape. Proc Natl Acad Sci U S A (2015) 1.46

Specificity and nonspecificity in RNA-protein interactions. Nat Rev Mol Cell Biol (2015) 1.08

Rapid characterization of CRISPR-Cas9 protospacer adjacent motif sequence elements. Genome Biol (2015) 0.93

Protection from oxidative stress relies mainly on derepression of OxyR-dependent KatB and Dps in Shewanella oneidensis. J Bacteriol (2013) 0.90

DNA Shape Features Improve Transcription Factor Binding Site Predictions In Vivo. Cell Syst (2016) 0.88

Analysis of the RNA Binding Specificity Landscape of C5 Protein Reveals Structure and Sequence Preferences that Direct RNase P Specificity. Cell Chem Biol (2016) 0.87

Approaches for establishing the function of regulatory genetic variants involved in disease. Genome Med (2014) 0.87

High-resolution specificity from DNA sequencing highlights alternative modes of Lac repressor binding. Genetics (2014) 0.86

Identifying transcriptional cis-regulatory modules in animal genomes. Wiley Interdiscip Rev Dev Biol (2014) 0.82

Spec-seq: determining protein-DNA-binding specificity by sequencing. Brief Funct Genomics (2014) 0.80

Predicting transcription factor site occupancy using DNA sequence intrinsic and cell-type specific chromatin features. BMC Bioinformatics (2016) 0.78

CAGEd-oPOSSUM: motif enrichment analysis from CAGE-derived TSSs. Bioinformatics (2016) 0.78

Exploring comprehensive within-motif dependence of transcription factor binding in Escherichia coli. Sci Rep (2015) 0.78

Imputation for transcription factor binding predictions based on deep learning. PLoS Comput Biol (2017) 0.77

MORPHEUS, a Webtool for Transcription Factor Binding Analysis Using Position Weight Matrices with Dependency. PLoS One (2015) 0.77

Response Element Composition Governs Correlations between Binding Site Affinity and Transcription in Glucocorticoid Receptor Feed-forward Loops. J Biol Chem (2015) 0.77

Shapely DNA attracts the right partner. Proc Natl Acad Sci U S A (2015) 0.76

Quantitative modeling of gene expression using DNA shape features of binding sites. Nucleic Acids Res (2016) 0.75

Mapping specificity landscapes of RNA-protein interactions by high throughput sequencing. Methods (2017) 0.75

A quantitative understanding of lac repressor's binding specificity and flexibility. Quant Biol (2015) 0.75

Bioinformatic prediction of transcription factor binding sites at promoter regions of genes for photoperiod and vernalization responses in model and temperate cereal plants. BMC Genomics (2016) 0.75

Exposing the secrets of sex determination. Nat Struct Mol Biol (2015) 0.75

Quantifying the Impact of Non-coding Variants on Transcription Factor-DNA Binding. Res Comput Mol Biol (2017) 0.75

Inherent limitations of probabilistic models for protein-DNA binding specificity. PLoS Comput Biol (2017) 0.75

Articles cited by this

DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci U S A (1977) 790.54

A new method for sequencing DNA. Proc Natl Acad Sci U S A (1977) 250.51

Profile hidden Markov models. Bioinformatics (1998) 56.04

Regulatory sequences involved in the promotion and termination of RNA transcription. Annu Rev Genet (1979) 54.83

A catalogue of splice junction sequences. Nucleic Acids Res (1982) 39.41

Compilation and analysis of Escherichia coli promoter DNA sequences. Nucleic Acids Res (1983) 38.72

Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc Int Conf Intell Syst Mol Biol (1994) 37.96

Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment. Science (1993) 36.84

Sequence logos: a new way to display consensus sequences. Nucleic Acids Res (1990) 36.74

Systematic evolution of ligands by exponential enrichment: RNA ligands to bacteriophage T4 DNA polymerase. Science (1990) 35.96

Information content of binding sites on nucleotide sequences. J Mol Biol (1986) 30.48

Computer methods to locate signals in nucleic acid sequences. Nucleic Acids Res (1984) 21.53

MatInd and MatInspector: new fast and versatile tools for detection of consensus matches in nucleotide sequence data. Nucleic Acids Res (1995) 21.22

E. coli RNA polymerase interacts homologously with two different promoters. Cell (1980) 19.71

Translational initiation in prokaryotes. Annu Rev Microbiol (1981) 18.46

Selection of DNA binding sites by regulatory proteins. Statistical-mechanical theory and application to operators and promoters. J Mol Biol (1987) 17.53

Systematic localization of common disease-associated variation in regulatory DNA. Science (2012) 14.47

Identifying protein-binding sites from unaligned DNA fragments. Proc Natl Acad Sci U S A (1989) 14.45

Identifying DNA and protein patterns with statistically significant alignments of multiple sequences. Bioinformatics (1999) 14.36

Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitation. Nat Biotechnol (1998) 13.99

An integrated software system for analyzing ChIP-chip and ChIP-seq data. Nat Biotechnol (2008) 13.96

Characterization of translational initiation sites in E. coli. Nucleic Acids Res (1982) 11.91

MATCH: A tool for searching transcription factor binding sites in DNA sequences. Nucleic Acids Res (2003) 11.82

An algorithm for finding protein-DNA binding sites with applications to chromatin-immunoprecipitation microarray experiments. Nat Biotechnol (2002) 10.23

Global mapping of protein-DNA interactions in vivo by digital genomic footprinting. Nat Methods (2009) 10.17

Use of the 'Perceptron' algorithm to distinguish translational initiation sites in E. coli. Nucleic Acids Res (1982) 9.99

Diversity and complexity in DNA recognition by transcription factors. Science (2009) 9.07

Regulatory element detection using correlation with expression. Nat Genet (2001) 8.92

Compact, universal DNA microarrays to comprehensively determine transcription-factor binding site specificities. Nat Biotechnol (2006) 8.38

Escherichia coli promoter sequences predict in vitro RNA polymerase selectivity. Nucleic Acids Res (1984) 8.00

An expansive human regulatory lexicon encoded in transcription factor footprints. Nature (2012) 7.27

DNA recognition by Cys2His2 zinc finger proteins. Annu Rev Biophys Biomol Struct (2000) 6.61

MEME-ChIP: motif analysis of large DNA datasets. Bioinformatics (2011) 6.52

Needles in stacks of needles: finding disease-causal variants in a wealth of genomic data. Nat Rev Genet (2011) 6.52

DNase I sensitivity QTLs are a major determinant of human expression variation. Nature (2012) 6.17

Accurate inference of transcription factor binding from DNA sequence and chromatin accessibility data. Genome Res (2010) 5.40

A systems approach to measuring the binding energy landscapes of transcription factors. Science (2007) 5.06

Additivity in protein-DNA interactions: how good an approximation is it? Nucleic Acids Res (2002) 5.04

Specificity, free energy and information content in protein-DNA interactions. Trends Biochem Sci (1998) 5.00

Rigorous pattern-recognition methods for DNA sequences. Analysis of promoter sequences from Escherichia coli. J Mol Biol (1985) 4.95

Nucleotides of transcription factor binding sites exert interdependent effects on the binding affinities of transcription factors. Nucleic Acids Res (2002) 4.83

An expectation maximization (EM) algorithm for the identification and characterization of common sites in unaligned biopolymer sequences. Proteins (1990) 4.42

Finding the most significant common sequence and structure motifs in a set of RNA sequences. Nucleic Acids Res (1997) 4.04

Search algorithm for pattern match analysis of nucleic acid sequences. Nucleic Acids Res (1983) 3.81

The ribosome binding sites recognized by E. coli ribosomes have regions with signal character in both the leader and protein coding segments. Nucleic Acids Res (1980) 3.74

Extensive low-affinity transcriptional interactions in the yeast genome. Genome Res (2006) 3.67

Statistical mechanical modeling of genome-wide transcription factor occupancy data by MatrixREDUCE. Bioinformatics (2006) 3.67

Non-independence of Mnt repressor-operator interaction determined by a new quantitative multiple fluorescence relative affinity (QuMFRA) assay. Nucleic Acids Res (2001) 3.63

Multiplexed massively parallel SELEX for characterization of human transcription factor binding specificities. Genome Res (2010) 3.46

Cofactor binding evokes latent differences in DNA binding specificity between Hox proteins. Cell (2011) 3.40

On the specificity of DNA-protein interactions. Proc Natl Acad Sci U S A (1986) 3.11

Quantitative analysis of the relationship between nucleotide sequence and functional activity. Nucleic Acids Res (1986) 3.02

Analysis of the sequence-specific interactions between Cro repressor and operator DNA by systematic base substitution experiments. Proc Natl Acad Sci U S A (1989) 2.76

Inferring binding energies from selected binding sites. PLoS Comput Biol (2009) 2.66

A biophysical approach to transcription factor binding site discovery. Genome Res (2003) 2.62

Lambda repressor recognizes the approximately 2-fold symmetric half-operator sequences asymmetrically. Proc Natl Acad Sci U S A (1989) 2.61

Direct measurement of DNA affinity landscapes on a high-throughput sequencing instrument. Nat Biotechnol (2011) 2.60

Quantitative analysis demonstrates most transcription factors require only simple models of specificity. Nat Biotechnol (2011) 2.58

Circuitry and dynamics of human transcription factor regulatory networks. Cell (2012) 2.56

Dissecting the regulatory architecture of gene expression QTLs. Genome Biol (2012) 2.51

Inferring direct DNA binding from ChIP-seq. Nucleic Acids Res (2012) 2.41

Determining the specificity of protein-DNA interactions. Nat Rev Genet (2010) 2.38

Evaluation of methods for modeling transcription factor sequence specificity. Nat Biotechnol (2013) 2.34

A systematic characterization of factors that regulate Drosophila segmentation via a bacterial one-hybrid system. Nucleic Acids Res (2008) 2.30

Expectation maximization algorithm for identifying protein-binding sites with variable lengths from unaligned DNA fragments. J Mol Biol (1992) 2.29

A bacterial one-hybrid system for determining the DNA-binding specificity of transcription factors. Nat Biotechnol (2005) 2.28

Probabilistic code for DNA recognition by proteins of the EGR family. J Mol Biol (2002) 2.23

UniPROBE, update 2011: expanded content and search tools in the online database of protein-binding microarray data on protein-DNA interactions. Nucleic Acids Res (2010) 2.16

A nucleosome-guided map of transcription factor binding sites in yeast. PLoS Comput Biol (2007) 2.15

De novo identification and biophysical characterization of transcription-factor binding sites with microfluidic affinity analysis. Nat Biotechnol (2010) 2.05

Pattern recognition in several sequences: consensus and alignment. Bull Math Biol (1984) 1.96

Quantitative specificity of the Mnt repressor. J Mol Biol (1997) 1.91

Bind-n-Seq: high-throughput analysis of in vitro protein-DNA interactions using massively parallel sequencing. Nucleic Acids Res (2009) 1.88

An optimized two-finger archive for ZFN-mediated gene targeting. Nat Methods (2012) 1.84

Zinc finger protein-dependent and -independent contributions to the in vivo off-target activity of zinc finger nucleases. Nucleic Acids Res (2010) 1.81

Ab initio prediction of transcription factor targets using structural knowledge. PLoS Comput Biol (2005) 1.64

The discovery of zinc fingers and their development for practical applications in gene regulation and genome manipulation. Q Rev Biophys (2010) 1.63

Improved models for transcription factor binding site identification using nonindependent interactions. Genetics (2012) 1.61

A statistical model for investigating binding probabilities of DNA nucleotide sequences using microarrays. Biometrics (2002) 1.54

High resolution models of transcription factor-DNA affinities improve in vitro and in vivo binding predictions. PLoS Comput Biol (2010) 1.52

Predictive modeling of genome-wide mRNA expression: from modules to molecules. Annu Rev Biophys Biomol Struct (2007) 1.51

Design of compact, universal DNA microarrays for protein binding microarray experiments. J Comput Biol (2008) 1.45

Non-DNA-binding cofactors enhance DNA-binding specificity of a transcriptional regulatory complex. Mol Syst Biol (2011) 1.34

Profiling the DNA-binding specificities of engineered Cys2His2 zinc finger domains using a rapid cell-based method. Nucleic Acids Res (2007) 1.17

Discovering structural cis-regulatory elements by modeling the behaviors of mRNAs. Mol Syst Biol (2009) 1.17

Maximally efficient modeling of DNA sequence motifs at all levels of complexity. Genetics (2011) 1.10

Putting numbers on the network connections. Bioessays (2007) 1.02

fREDUCE: detection of degenerate regulatory elements using correlation with expression. BMC Bioinformatics (2007) 1.01

A modified bacterial one-hybrid system yields improved quantitative models of transcription factor specificity. Nucleic Acids Res (2011) 0.97

Using defined finger-finger interfaces as units of assembly for constructing zinc-finger nucleases. Nucleic Acids Res (2013) 0.91

Neural networks for determining protein specificity and multiple alignment of binding sites. Proc Int Conf Intell Syst Mol Biol (1994) 0.87

Exploring the DNA-recognition potential of homeodomains. Genome Res (2012) 0.83

Articles by these authors

An improved map of conserved regulatory sites for Saccharomyces cerevisiae. BMC Bioinformatics (2006) 11.13

Comparative genomics identifies a flagellar and basal body proteome that includes the BBS5 human disease gene. Cell (2004) 6.10

Combining phylogenetic data with co-regulated genes to identify regulatory motifs. Bioinformatics (2003) 5.99

Additivity in protein-DNA interactions: how good an approximation is it? Nucleic Acids Res (2002) 5.04

enoLOGOS: a versatile web tool for energy normalized sequence logos. Nucleic Acids Res (2005) 4.03

Analysis of homeodomain specificities allows the family-wide prediction of preferred recognition sites. Cell (2008) 3.96

The AP-1 transcription factor Batf controls T(H)17 differentiation. Nature (2009) 3.75

Inferring binding energies from selected binding sites. PLoS Comput Biol (2009) 2.66

Quantitative analysis demonstrates most transcription factors require only simple models of specificity. Nat Biotechnol (2011) 2.58

Pairwise local structural alignment of RNA sequences with sequence similarity less than 40%. Bioinformatics (2005) 2.43

The neuropeptide pigment-dispersing factor coordinates pacemaker interactions in the Drosophila circadian system. J Neurosci (2004) 2.42

Target selectivity of vertebrate notch proteins. Collaboration between discrete domains and CSL-binding site architecture determines activation probability. J Biol Chem (2005) 2.33

Influence of the period-dependent circadian clock on diurnal, circadian, and aperiodic gene expression in Drosophila melanogaster. Proc Natl Acad Sci U S A (2002) 2.33

Probabilistic code for DNA recognition by proteins of the EGR family. J Mol Biol (2002) 2.23

Is there a code for protein-DNA recognition? Probab(ilistical)ly. . . Bioessays (2002) 2.17

Direct, androgen receptor-mediated regulation of the FKBP5 gene via a distal enhancer element. Endocrinology (2005) 2.04

An iterated loop matching approach to the prediction of RNA secondary structures with pseudoknots. Bioinformatics (2004) 1.89

Identification of a novel cis-regulatory element involved in the heat shock response in Caenorhabditis elegans using microarray gene expression and computational methods. Genome Res (2002) 1.87

An optimized two-finger archive for ZFN-mediated gene targeting. Nat Methods (2012) 1.84

A systematic model to predict transcriptional regulatory mechanisms based on overrepresentation of transcription factor binding profiles. Genome Res (2006) 1.78

Identifying the conserved network of cis-regulatory sites of a eukaryotic genome. Proc Natl Acad Sci U S A (2005) 1.72

Improved models for transcription factor binding site identification using nonindependent interactions. Genetics (2012) 1.61

Training the next generation of informaticians: the impact of "BISTI" and bioinformatics--a report from the American College of Medical Informatics. J Am Med Inform Assoc (2004) 1.53

Context-dependent DNA recognition code for C2H2 zinc-finger transcription factors. Bioinformatics (2008) 1.42

ScerTF: a comprehensive database of benchmarked position weight matrices for Saccharomyces species. Nucleic Acids Res (2011) 1.37

A graph theoretical approach for predicting common RNA secondary structure motifs including pseudoknots in unaligned sequences. Bioinformatics (2004) 1.35

RNA Sampler: a new sampling based algorithm for common RNA secondary structure prediction and structural alignment. Bioinformatics (2007) 1.32

Quantitative analysis of EGR proteins binding to DNA: assessing additivity in both the binding site and the protein. BMC Bioinformatics (2005) 1.31

Combining SELEX with quantitative assays to rapidly obtain accurate models of protein-DNA interactions. Nucleic Acids Res (2005) 1.27

Computational technique for improvement of the position-weight matrices for the DNA/protein binding sites. Nucleic Acids Res (2005) 1.22

RNA interference of achaete-scute homolog 1 in mouse prostate neuroendocrine cells reveals its gene targets and DNA binding sites. Proc Natl Acad Sci U S A (2004) 1.22

Discovering structural cis-regulatory elements by modeling the behaviors of mRNAs. Mol Syst Biol (2009) 1.17

Making connections between novel transcription factors and their DNA motifs. Genome Res (2005) 1.16

PromoLign: a database for upstream region analysis and SNPs. Hum Mutat (2004) 1.15

Expression profiling using random genomic DNA microarrays identifies differentially expressed genes associated with three major developmental stages of the protozoan parasite Leishmania major. Mol Biochem Parasitol (2004) 1.14

Novel transcription regulatory elements in Caenorhabditis elegans muscle genes. Genome Res (2004) 1.13

Computational identification of the normal and perturbed genetic networks involved in myeloid differentiation and acute promyelocytic leukemia. Genome Biol (2008) 1.12

PolyMAPr: programs for polymorphism database mining, annotation, and functional analysis. Hum Mutat (2005) 1.12

PAP: a comprehensive workbench for mammalian transcriptional regulatory sequence analysis. Nucleic Acids Res (2007) 1.11

Recognition models to predict DNA-binding specificities of homeodomain proteins. Bioinformatics (2012) 1.09

Identification of muscle-specific regulatory modules in Caenorhabditis elegans. Genome Res (2007) 1.07

Global analysis of Drosophila Cys₂-His₂ zinc finger proteins reveals a multitude of novel recognition motifs and binding determinants. Genome Res (2013) 1.07

Molecular diagnostics in sepsis: from bedside to bench. J Am Coll Surg (2006) 1.02

An improved predictive recognition model for Cys(2)-His(2) zinc finger proteins. Nucleic Acids Res (2014) 1.02

Computational identification and functional validation of regulatory motifs in cartilage-expressed genes. Genome Res (2007) 1.01

Analysis of Chlamydomonas reinhardtii genome structure using large-scale sequencing of regions on linkage groups I and III. J Eukaryot Microbiol (2003) 1.01

Novel sequence-based method for identifying transcription factor binding sites in prokaryotic genomes. Bioinformatics (2010) 0.98

A modified bacterial one-hybrid system yields improved quantitative models of transcription factor specificity. Nucleic Acids Res (2011) 0.97

Novel modeling of combinatorial miRNA targeting identifies SNP with potential role in bone density. PLoS Comput Biol (2012) 0.97

The cis-regulatory map of Shewanella genomes. Nucleic Acids Res (2008) 0.96

ILM: a web server for predicting RNA secondary structures with pseudoknots. Nucleic Acids Res (2004) 0.96

Identification of cilia genes that affect cell-cycle progression using whole-genome transcriptome analysis in Chlamydomonas reinhardtti. G3 (Bethesda) (2013) 0.93

Sepsis gene expression profiling: murine splenic compared with hepatic responses determined by using complementary DNA microarrays. Crit Care Med (2002) 0.92

Quantitative modeling of DNA-protein interactions: effects of amino acid substitutions on binding specificity of the Mnt repressor. Nucleic Acids Res (2004) 0.92

Computational identification of the Spo0A-phosphate regulon that is essential for the cellular differentiation and development in Gram-positive spore-forming bacteria. Nucleic Acids Res (2003) 0.92

A global approach to identify differentially expressed genes in cDNA (two-color) microarray experiments. Bioinformatics (2007) 0.92

Editing efficiency of a Drosophila gene correlates with a distant splice site selection. RNA (2005) 0.92

Using defined finger-finger interfaces as units of assembly for constructing zinc-finger nucleases. Nucleic Acids Res (2013) 0.91

A nutrient-sensitive interaction between Sirt1 and HNF-1α regulates Crp expression. Aging Cell (2011) 0.91

Modeling the quantitative specificity of DNA-binding proteins from example binding sites. PLoS One (2009) 0.91

Transcriptional profiles of human epithelial cells in response to heat: computational evidence for novel heat shock proteins. Shock (2008) 0.90

Discovering cis-regulatory RNAs in Shewanella genomes by Support Vector Machines. PLoS Comput Biol (2009) 0.88

Specificity of Mnt 'master residue' obtained from in vivo and in vitro selections. Nucleic Acids Res (2002) 0.88

Improving gene-finding in Chlamydomonas reinhardtii:GreenGenie2. BMC Genomics (2009) 0.88

Procom: a web-based tool to compare multiple eukaryotic proteomes. Bioinformatics (2004) 0.87

Genome wide screens in yeast to identify potential binding sites and target genes of DNA-binding proteins. Nucleic Acids Res (2007) 0.87

Putting the Leishmania genome to work: functional genomics by transposon trapping and expression profiling. Philos Trans R Soc Lond B Biol Sci (2002) 0.87

Exploring the DNA-recognition potential of homeodomains. Genome Res (2012) 0.83

Discriminative motif optimization based on perceptron training. Bioinformatics (2013) 0.81

Evidence for active maintenance of inverted repeat structures identified by a comparative genomic approach. PLoS One (2007) 0.81

Conserved Motifs and Prediction of Regulatory Modules in Caenorhabditis elegans. G3 (Bethesda) (2012) 0.81

Spec-seq: determining protein-DNA-binding specificity by sequencing. Brief Funct Genomics (2014) 0.80

Detecting Coevolution of Functionally Related Proteins for Automated Protein Annotation. Proc IEEE Int Symp Bioinformatics Bioeng (2010) 0.79

Fast, sensitive discovery of conserved genome-wide motifs. J Comput Biol (2012) 0.78

Assessing the effects of symmetry on motif discovery and modeling. PLoS One (2011) 0.77

Evolution. Heirlooms in the attic. Science (2003) 0.75

Using mRNAs lengths to accurately predict the alternatively spliced gene products in Caenorhabditis elegans. Bioinformatics (2006) 0.75