PhyME: a probabilistic algorithm for finding motifs in sets of orthologous sequences.

PubWeight™: 3.63‹?› | Rank: Top 1%

🔗 View Article (PMC 534098)

Published in BMC Bioinformatics on October 28, 2004

Authors

Saurabh Sinha1, Mathieu Blanchette, Martin Tompa

Author Affiliations

1: Center for Studies in Physics and Biology, The Rockefeller University, New York, NY 10021, USA. saurabh@lonnrot.rockefeller.edu

Articles citing this

Systematic discovery of regulatory motifs in human promoters and 3' UTRs by comparison of several mammals. Nature (2005) 31.60

An improved map of conserved regulatory sites for Saccharomyces cerevisiae. BMC Bioinformatics (2006) 11.13

Limitations and potentials of current motif discovery algorithms. Nucleic Acids Res (2005) 4.28

The Forkhead transcription factor Hcm1 regulates chromosome segregation genes and fills the S-phase gap in the transcriptional circuitry of the cell cycle. Genes Dev (2006) 3.58

PhyloGibbs: a Gibbs sampling motif finder that incorporates phylogeny. PLoS Comput Biol (2005) 3.35

A survey of DNA motif finding algorithms. BMC Bioinformatics (2007) 2.79

Practical strategies for discovering regulatory DNA sequence motifs. PLoS Comput Biol (2006) 2.53

Comparative genomic reconstruction of transcriptional regulatory networks in bacteria. Chem Rev (2007) 2.40

SwissRegulon: a database of genome-wide annotations of regulatory sites. Nucleic Acids Res (2006) 2.30

A survey of motif discovery methods in an integrated framework. Biol Direct (2006) 1.94

Computational identification of transcriptional regulatory elements in DNA sequence. Nucleic Acids Res (2006) 1.88

Genome-wide prediction of transcription factor binding sites using an integrated model. Genome Biol (2010) 1.84

Identifying the conserved network of cis-regulatory sites of a eukaryotic genome. Proc Natl Acad Sci U S A (2005) 1.72

iRegulon: from a gene list to a gene regulatory network using large motif and track collections. PLoS Comput Biol (2014) 1.69

A comparative analysis of genome-wide chromatin immunoprecipitation data for mammalian transcription factors. Nucleic Acids Res (2006) 1.67

The value of position-specific priors in motif discovery using MEME. BMC Bioinformatics (2010) 1.60

Automatic image analysis for gene expression patterns of fly embryos. BMC Cell Biol (2007) 1.47

Sampling motifs on phylogenetic trees. Proc Natl Acad Sci U S A (2005) 1.45

EVOPRINTER, a multigenomic comparative tool for rapid identification of functionally important DNA. Proc Natl Acad Sci U S A (2005) 1.38

Finding regulatory DNA motifs using alignment-free evolutionary conservation information. Nucleic Acids Res (2010) 1.33

Meta-analysis discovery of tissue-specific DNA sequence motifs from mammalian gene expression data. BMC Bioinformatics (2006) 1.28

Mechanisms and evolution of control logic in prokaryotic transcriptional regulation. Microbiol Mol Biol Rev (2009) 1.25

Evaluation of phylogenetic footprint discovery for predicting bacterial cis-regulatory elements and revealing their evolution. BMC Bioinformatics (2008) 1.22

Finding regulatory elements and regulatory motifs: a general probabilistic framework. BMC Bioinformatics (2007) 1.22

Large-scale cis-element detection by analysis of correlated expression and sequence conservation between Arabidopsis and Brassica oleracea. Plant Physiol (2006) 1.19

MORPH: probabilistic alignment combined with hidden Markov models of cis-regulatory modules. PLoS Comput Biol (2007) 1.17

A phylogenetic Gibbs sampler that yields centroid solutions for cis-regulatory site prediction. Bioinformatics (2007) 1.17

Universal patterns of purifying selection at noncoding positions in bacteria. Genome Res (2007) 1.11

Identification of novel regulatory modules in dicotyledonous plants using expression data and comparative genomics. Genome Biol (2006) 1.08

Retracted Performance evaluation of DNA motif discovery programs. Bioinformation (2008) 1.08

Motif discovery and transcription factor binding sites before and after the next-generation sequencing era. Brief Bioinform (2012) 1.05

Redundant ERF-VII Transcription Factors Bind to an Evolutionarily Conserved cis-Motif to Regulate Hypoxia-Responsive Gene Expression in Arabidopsis. Plant Cell (2015) 1.04

MicroFootPrinter: a tool for phylogenetic footprinting in prokaryotic genomes. Nucleic Acids Res (2006) 1.03

Cross-species de novo identification of cis-regulatory modules with GibbsModule: application to gene regulation in embryonic stem cells. Genome Res (2008) 1.02

Identifying regulatory elements in eukaryotic genomes. Brief Funct Genomic Proteomic (2009) 1.02

Reliable prediction of transcription factor binding sites by phylogenetic verification. Proc Natl Acad Sci U S A (2005) 1.02

WeederH: an algorithm for finding conserved regulatory motifs and regions in homologous sequences. BMC Bioinformatics (2007) 0.99

Genome-wide de novo prediction of cis-regulatory binding sites in prokaryotes. Nucleic Acids Res (2009) 0.98

STEME: efficient EM to find motifs in large data sets. Nucleic Acids Res (2011) 0.96

Unraveling networks of co-regulated genes on the sole basis of genome sequences. Nucleic Acids Res (2011) 0.96

Evolution and selection in yeast promoters: analyzing the combined effect of diverse transcription factor binding sites. PLoS Comput Biol (2008) 0.96

PhyloGibbs-MP: module prediction and discriminative motif-finding by Gibbs sampling. PLoS Comput Biol (2008) 0.95

Comparative analysis of regulatory motif discovery tools for transcription factor binding sites. Genomics Proteomics Bioinformatics (2007) 0.95

Variable structure motifs for transcription factor binding sites. BMC Genomics (2010) 0.95

EMD: an ensemble algorithm for discovering regulatory motifs in DNA sequences. BMC Bioinformatics (2006) 0.95

Modeling an evolutionary conserved circadian cis-element. PLoS Comput Biol (2008) 0.93

Evolutionary divergence and limits of conserved non-coding sequence detection in plant genomes. Nucleic Acids Res (2011) 0.93

Analysis of the SOS response of Vibrio and other bacteria with multiple chromosomes. BMC Genomics (2012) 0.92

On the value of intra-motif dependencies of human insulator protein CTCF. PLoS One (2014) 0.92

Identifications of conserved 7-mers in 3'-UTRs and microRNAs in Drosophila. BMC Bioinformatics (2007) 0.91

Systematic identification of conserved motif modules in the human genome. BMC Genomics (2010) 0.91

Combining comparative genomics with de novo motif discovery to identify human transcription factor DNA-binding motifs. BMC Bioinformatics (2006) 0.91

Integrating sequence, evolution and functional genomics in regulatory genomics. Genome Biol (2009) 0.90

GibbsST: a Gibbs sampling method for motif discovery with enhanced resistance to local optima. BMC Bioinformatics (2006) 0.90

Discovery of cis-elements between sorghum and rice using co-expression and evolutionary conservation. BMC Genomics (2009) 0.89

CSMET: comparative genomic motif detection via multi-resolution phylogenetic shadowing. PLoS Comput Biol (2008) 0.87

Sigma: multiple alignment of weakly-conserved non-coding DNA sequence. BMC Bioinformatics (2006) 0.86

Erasing errors due to alignment ambiguity when estimating positive selection. Mol Biol Evol (2014) 0.84

Systematic prediction of cis-regulatory elements in the Chlamydomonas reinhardtii genome using comparative genomics. Plant Physiol (2012) 0.84

BLISS: binding site level identification of shared signal-modules in DNA regulatory sequences. BMC Bioinformatics (2006) 0.83

CoMoDis: composite motif discovery in mammalian genomes. Nucleic Acids Res (2006) 0.83

A Monte Carlo-based framework enhances the discovery and interpretation of regulatory sequence motifs. BMC Bioinformatics (2012) 0.81

Strategies for reliable exploitation of evolutionary concepts in high throughput biology. Evol Bioinform Online (2008) 0.81

Phylogeny based discovery of regulatory elements. BMC Bioinformatics (2006) 0.81

DISCOVER: a feature-based discriminative method for motif search in complex genomes. Bioinformatics (2009) 0.81

iCR: a web tool to identify conserved targets of a regulatory protein across the multiple related prokaryotic species. Nucleic Acids Res (2006) 0.81

Integration of known transcription factor binding site information and gene expression data to advance from co-expression to co-regulation. Genomics Proteomics Bioinformatics (2007) 0.80

MotifClick: prediction of cis-regulatory binding sites via merging cliques. BMC Bioinformatics (2011) 0.80

New insights into the genetic regulation of Plasmodium falciparum obtained by Bayesian modeling. Gene Regul Syst Bio (2007) 0.80

Cell interactions and patterned intercalations shape and link epithelial tubes in C. elegans. PLoS Genet (2013) 0.79

BLISS 2.0: a web-based tool for predicting conserved regulatory modules in distantly-related orthologous sequences. Bioinformatics (2007) 0.79

The effect of orthology and coregulation on detecting regulatory motifs. PLoS One (2010) 0.78

MTAP: the motif tool assessment platform. BMC Bioinformatics (2008) 0.78

Web-based resources for comparative genomics. Hum Genomics (2005) 0.78

Inference of transcriptional regulation using gene expression data from the bovine and human genomes. BMC Genomics (2007) 0.78

Recent computational approaches to understand gene regulation: mining gene regulation in silico. Curr Genomics (2007) 0.76

GRISOTTO: A greedy approach to improve combinatorial algorithms for motif discovery with prior knowledge. Algorithms Mol Biol (2011) 0.76

TargetOrtho: a phylogenetic footprinting tool to identify transcription factor targets. Genetics (2014) 0.76

Occupancy classification of position weight matrix-inferred transcription factor binding sites. PLoS One (2011) 0.75

Systems analysis of cis-regulatory motifs in C4 photosynthesis genes using maize and rice leaf transcriptomic data during a process of de-etiolation. J Exp Bot (2016) 0.75

Combining phylogenetic footprinting with motif models incorporating intra-motif dependencies. BMC Bioinformatics (2017) 0.75

Articles cited by this

Evolutionary trees from DNA sequences: a maximum likelihood approach. J Mol Evol (1981) 67.56

Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment. Science (1993) 36.84

Sequencing and comparison of yeast species to identify genes and regulatory elements. Nature (2003) 29.16

fastDNAmL: a tool for construction of phylogenetic trees of DNA sequences using maximum likelihood. Comput Appl Biosci (1994) 24.63

Aligning multiple genomic sequences with the threaded blockset aligner. Genome Res (2004) 24.52

LAGAN and Multi-LAGAN: efficient tools for large-scale multiple alignment of genomic DNA. Genome Res (2003) 23.03

Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitation. Nat Biotechnol (1998) 13.99

TRANSFAC: a database on transcription factors and their DNA binding sites. Nucleic Acids Res (1996) 13.50

Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies. J Mol Biol (1998) 8.34

Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster euchromatic genome sequence. Genome Biol (2002) 8.07

rVista for comparative sequence-based discovery of functional transcription factor binding sites. Genome Res (2002) 7.33

SCPD: a promoter database of the yeast Saccharomyces cerevisiae. Bioinformatics (1999) 6.79

Identification of consensus patterns in unaligned DNA sequences known to be functionally related. Comput Appl Biosci (1990) 6.16

Combining phylogenetic data with co-regulated genes to identify regulatory motifs. Bioinformatics (2003) 5.99

Regulatory sequence analysis tools. Nucleic Acids Res (2003) 5.72

Conservation of DNA regulatory motifs and discovery of new motifs in microbial genomes. Genome Res (2000) 4.34

Discovery of regulatory elements by a computational method for phylogenetic footprinting. Genome Res (2002) 4.25

Surveying Saccharomyces genomes to identify functional elements by comparative DNA sequence analysis. Genome Res (2001) 4.22

Computational detection of genomic cis-regulatory modules applied to body patterning in the early Drosophila embryo. BMC Bioinformatics (2002) 3.61

Gibbs Recursive Sampler: finding transcription factor binding sites. Nucleic Acids Res (2003) 3.53

Eukaryotic regulatory element conservation analysis and identification using comparative genomics. Genome Res (2004) 2.84

Prediction of transcription regulatory sites in Archaea by a comparative genomic approach. Nucleic Acids Res (2000) 2.80

Phylogenetic motif detection by expectation-maximization on evolutionary mixtures. Pac Symp Biocomput (2004) 2.17

Conservation of regulatory elements between two species of Drosophila. BMC Bioinformatics (2003) 2.06

Identification of a novel cis-regulatory element involved in the heat shock response in Caenorhabditis elegans using microarray gene expression and computational methods. Genome Res (2002) 1.87

Motif discovery in heterogeneous sequence data. Pac Symp Biocomput (2004) 1.60

Articles by these authors

Identification and characterization of multi-species conserved sequences. Genome Res (2003) 10.18

Rv3133c/dosR is a transcription factor that mediates the hypoxic response of Mycobacterium tuberculosis. Mol Microbiol (2003) 5.68

Global patterns of cis variation in human cells revealed by high-density allelic expression analysis. Nat Genet (2009) 4.72

Finding motifs using random projections. J Comput Biol (2002) 4.61

Systematic analysis of the protein interaction network for the human transcription machinery reveals the identity of the 7SK capping enzyme. Mol Cell (2007) 4.58

Variant histone H2A.Z is globally localized to the promoters of inactive yeast genes and regulates nucleosome positioning. PLoS Biol (2005) 4.48

Discovery of regulatory elements by a computational method for phylogenetic footprinting. Genome Res (2002) 4.25

YMF: A program for discovery of novel transcription factor binding sites by statistical overrepresentation. Nucleic Acids Res (2003) 3.71

Reconstructing contiguous regions of an ancestral genome. Genome Res (2006) 3.68

Identification of 22 candidate structured RNAs in bacteria using the CMfinder comparative genomics pipeline. Nucleic Acids Res (2007) 3.59

Discovery of novel transcription factor binding sites by statistical overrepresentation. Nucleic Acids Res (2002) 3.30

Genome-wide orchestration of cardiac functions by the orphan nuclear receptors ERRalpha and gamma. Cell Metab (2007) 3.00

FootPrinter: A program designed for phylogenetic footprinting. Nucleic Acids Res (2003) 2.82

Phylo: a citizen science approach for improving multiple sequence alignment. PLoS One (2012) 2.64

A probabilistic approach for SNP discovery in high-throughput human resequencing data. Genome Res (2009) 2.55

Into the heart of darkness: large-scale clustering of human non-coding DNA. Bioinformatics (2004) 2.50

Algorithms for phylogenetic footprinting. J Comput Biol (2002) 2.46

PReMod: a database of genome-wide mammalian cis-regulatory module predictions. Nucleic Acids Res (2006) 2.27

Evolutionarily conserved sequence elements that positively regulate IFN-gamma expression in T cells. Proc Natl Acad Sci U S A (2004) 2.21

The relationship between DNA methylation, genetic and expression inter-individual variation in untransformed human fibroblasts. Genome Biol (2014) 2.01

Three-dimensional modeling of chromatin structure from interaction frequency data using Markov chain Monte Carlo sampling. BMC Bioinformatics (2011) 1.78

The Capsella rubella genome and the genomic consequences of rapid mating system evolution. Nat Genet (2013) 1.69

An atlas of over 90,000 conserved noncoding sequences provides insight into crucifer regulatory regions. Nat Genet (2013) 1.68

Chromatin conformation signatures of cellular differentiation. Genome Biol (2009) 1.68

Discovery of regulatory elements in vertebrates through comparative genomics. Nat Biotechnol (2005) 1.65

A computational pipeline for high- throughput discovery of cis-regulatory noncoding RNA in prokaryotes. PLoS Comput Biol (2007) 1.53

Measuring the accuracy of genome-size multiple alignments. Genome Biol (2007) 1.50

BH3-ligand regulates access of MCL-1 to its E3 ligase. FEBS Lett (2005) 1.49

A newly uncovered group of distantly related lysine methyltransferases preferentially interact with molecular chaperones to regulate their activity. PLoS Genet (2013) 1.37

The three-dimensional architecture of Hox cluster silencing. Nucleic Acids Res (2010) 1.36

Exact and heuristic algorithms for the Indel Maximum Likelihood Problem. J Comput Biol (2007) 1.35

Analysis of computational approaches for motif discovery. Algorithms Mol Biol (2006) 1.34

High-resolution mapping of the protein interaction network for the human transcription machinery and affinity purification of RNA polymerase II-associated complexes. Methods (2009) 1.24

The protein interaction network of the human transcription machinery reveals a role for the conserved GTPase RPAP4/GPN1 and microtubule assembly in nuclear import and biogenesis of RNA polymerase II. Mol Cell Proteomics (2010) 1.23

On the inference of parsimonious indel evolutionary scenarios. J Bioinform Comput Biol (2006) 1.14

Seeder: discriminative seeding DNA motif discovery. Bioinformatics (2008) 1.10

Open-Phylo: a customizable crowd-computing platform for multiple sequence alignment. Genome Biol (2013) 1.06

Comparative assessment of methods for aligning multiple genome sequences. Nat Biotechnol (2010) 1.03

A flexible ancestral genome reconstruction method based on gapped adjacencies. BMC Bioinformatics (2012) 1.03

Modeling contaminants in AP-MS/MS experiments. J Proteome Res (2010) 1.03

MicroFootPrinter: a tool for phylogenetic footprinting in prokaryotic genomes. Nucleic Acids Res (2006) 1.03

Ancestors 1.0: a web server for ancestral sequence reconstruction. Bioinformatics (2009) 1.01

Computational analysis of whole-genome differential allelic expression data in human. PLoS Comput Biol (2010) 0.99

Meta-analysis of inter-species liver co-expression networks elucidates traits associated with common human diseases. PLoS Comput Biol (2009) 0.99

Identification of the Treponema pallidum subsp. pallidum TP0092 (RpoE) regulon and its implications for pathogen persistence in the host and syphilis pathogenesis. J Bacteriol (2012) 0.99

Detecting non-coding selective pressure in coding regions. BMC Evol Biol (2007) 0.98

Long-range regulation is a major driving force in maintaining genome integrity. BMC Evol Biol (2009) 0.94

Computational prediction of the localization of microRNAs within their pre-miRNA. Nucleic Acids Res (2013) 0.94

Statistics of local multiple alignments. Bioinformatics (2005) 0.93

Mis-translation of a computationally designed protein yields an exceptionally stable homodimer: implications for protein engineering and evolution. J Mol Biol (2006) 0.93

FootPrinter3: phylogenetic footprinting in partially alignable sequences. Nucleic Acids Res (2006) 0.92

How accurately is ncRNA aligned within whole-genome multiple alignments? BMC Bioinformatics (2007) 0.92

Computing chromosome conformation. Methods Mol Biol (2010) 0.90

Hox in motion: tracking HoxA cluster conformation during differentiation. Nucleic Acids Res (2013) 0.90

Combining computational prediction of cis-regulatory elements with a new enhancer assay to efficiently label neuronal structures in the medaka fish. PLoS One (2011) 0.90

Prediction of tissue-specific cis-regulatory modules using Bayesian networks and regression trees. BMC Bioinformatics (2007) 0.89

Predicting direct protein interactions from affinity purification mass spectrometry data. Algorithms Mol Biol (2010) 0.86

Quality control in manufacturing oligo arrays: a combinatorial design approach. J Comput Biol (2002) 0.86

Following the 'tracks': Tramtrack69 regulates epithelial tube expansion in the Drosophila ovary through Paxillin, Dynamin, and the homeobox protein Mirror. Dev Biol (2013) 0.85

Improving the prediction of mRNA extremities in the parasitic protozoan Leishmania. BMC Bioinformatics (2008) 0.85

Detection of locally over-represented GO terms in protein-protein interaction networks. J Comput Biol (2010) 0.85

Genome-wide mouse mutagenesis reveals CD45-mediated T cell function as critical in protective immunity to HSV-1. PLoS Pathog (2013) 0.84

Nuclear import of RNA polymerase II is coupled with nucleocytoplasmic shuttling of the RNA polymerase II-associated protein 2. Nucleic Acids Res (2013) 0.84

An N-ethyl-N-nitrosourea (ENU)-induced dominant negative mutation in the JAK3 kinase protects against cerebral malaria. PLoS One (2012) 0.83

Classifying leukemia types with chromatin conformation data. Genome Biol (2014) 0.83

TP0262 is a modulator of promoter activity of tpr Subfamily II genes of Treponema pallidum ssp. pallidum. Mol Microbiol (2009) 0.83

A whole genome study and identification of specific carcinogenic regions of the human papilloma viruses. J Comput Biol (2009) 0.82

Steps towards a repertoire of comprehensive maps of human protein interaction networks: the Human Proteotheque Initiative (HuPI). Biochem Cell Biol (2008) 0.81

Discovery of cell compartment specific protein-protein interactions using affinity purification combined with tandem mass spectrometry. J Proteome Res (2012) 0.80

Genetic map refinement using a comparative genomic approach. J Comput Biol (2009) 0.79

Assessing the discordance of multiple sequence alignments. IEEE/ACM Trans Comput Biol Bioinform (2009) 0.79

Mapping association between long-range cis-regulatory regions and their target genes using synteny. J Comput Biol (2011) 0.77

Predicting site-specific human selective pressure using evolutionary signatures. Bioinformatics (2011) 0.77

Algorithms for locating extremely conserved elements in multiple sequence alignments. BMC Bioinformatics (2009) 0.77

SPARCS: a web server to analyze (un)structured regions in coding RNA sequences. Nucleic Acids Res (2013) 0.76

Positional mapping and candidate gene analysis of the mouse Ccs3 locus that regulates differential susceptibility to carcinogen-induced colorectal cancer. PLoS One (2013) 0.76

A probabilistic model for sequence alignment with context-sensitive indels. J Comput Biol (2011) 0.76

StructMiner: a tool for alignment and detection of conserved secondary structure. Genome Inform (2004) 0.75

An approximation algorithm for the Noah's Ark problem with random feature loss. IEEE/ACM Trans Comput Biol Bioinform (2011) 0.75

Construction of optimal quality control for oligo arrays. Bioinformatics (2002) 0.75

Gene maps linearization using genomic rearrangement distances. J Comput Biol (2007) 0.75

A practical algorithm for estimation of the maximum likelihood ancestral reconstruction error. Pac Symp Biocomput (2010) 0.75