Millions of years of evolution preserved: a comprehensive catalog of the processed pseudogenes in the human genome.

PubWeight™: 5.49‹?› | Rank: Top 1%

🔗 View Article (PMC 403796)

Published in Genome Res on December 01, 2003

Authors

Zhaolei Zhang1, Paul M Harrison, Yin Liu, Mark Gerstein

Author Affiliations

1: Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520-8114, USA.

Articles citing this

(truncated to the top 100)

TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol (2013) 32.42

Fusion transcripts and transcribed retrotransposed loci discovered through comprehensive transcriptome analysis using Paired-End diTags (PETs). Genome Res (2007) 4.99

A Trim5-cyclophilin A fusion protein found in owl monkey kidney cells can restrict HIV-1. Proc Natl Acad Sci U S A (2004) 4.06

Pseudogenes in the ENCODE regions: consensus annotation, analysis of transcription, and evolution. Genome Res (2007) 3.82

Pseudogene.org: a comprehensive database and comparison platform for pseudogene annotation. Nucleic Acids Res (2006) 3.62

Active human retrotransposons: variation and disease. Curr Opin Genet Dev (2012) 3.60

A neutral model of transcriptome evolution. PLoS Biol (2004) 3.44

Comparative genomics and molecular dynamics of DNA repeats in eukaryotes. Microbiol Mol Biol Rev (2008) 3.35

Transcription-mediated gene fusion in the human genome. Genome Res (2005) 3.19

Gene losses during human origins. PLoS Biol (2006) 3.17

Emergence of young human genes after a burst of retroposition in primates. PLoS Biol (2005) 3.08

Phylogenetic reconstruction of orthology, paralogy, and conserved synteny for dog and human. PLoS Comput Biol (2006) 3.07

RNA-based gene duplication: mechanistic and evolutionary insights. Nat Rev Genet (2009) 2.91

Transcribed processed pseudogenes in the human genome: an intermediate form of expressed retrosequence lacking protein-coding ability. Nucleic Acids Res (2005) 2.89

An ORFeome-based analysis of human transcription factor genes and the construction of a microarray to interrogate their expression. Genome Res (2004) 2.77

LINE-1 elements in structural variation and disease. Annu Rev Genomics Hum Genet (2011) 2.42

A 3'-untranslated region (3'UTR) induces organ adhesion by regulating miR-199a* functions. PLoS One (2009) 2.24

Identification and analysis of unitary pseudogenes: historic and contemporary gene losses in humans and other primates. Genome Biol (2010) 2.12

A systematic analysis of LINE-1 endonuclease-dependent retrotranspositional events causing human genetic disease. Hum Genet (2005) 2.10

High rate of chimeric gene origination by retroposition in plant genomes. Plant Cell (2006) 2.08

Comparative genomics search for losses of long-established genes on the human lineage. PLoS Comput Biol (2007) 2.07

Pseudogenes: pseudo-functional or key regulators in health and disease? RNA (2011) 2.00

Retrocopy contributions to the evolution of the human genome. BMC Genomics (2008) 1.95

Comparative genomics reveals a constant rate of origination and convergent acquisition of functional retrogenes in Drosophila. Genome Biol (2007) 1.94

LINE-1 ORF1 protein localizes in stress granules with other RNA-binding proteins, including components of RNA interference RNA-induced silencing complex. Mol Cell Biol (2007) 1.79

A computational approach for identifying pseudogenes in the ENCODE regions. Genome Biol (2006) 1.74

Comparative analysis of processed ribosomal protein pseudogenes in four mammalian genomes. Genome Biol (2009) 1.71

Characterization of intron loss events in mammals. Genome Res (2006) 1.56

The evolutionary fate of MULE-mediated duplications of host gene fragments in rice. Genome Res (2005) 1.54

Selection of reference genes for quantitative real-time PCR in a rat asphyxial cardiac arrest model. BMC Mol Biol (2008) 1.39

HOPPSIGEN: a database of human and mouse processed pseudogenes. Nucleic Acids Res (2005) 1.38

Genome-wide survey for biologically functional pseudogenes. PLoS Comput Biol (2006) 1.38

Mapping the LINE1 ORF1 protein interactome reveals associated inhibitors of human retrotransposition. Nucleic Acids Res (2013) 1.31

The human ABC transporter pseudogene family: Evidence for transcription and gene-pseudogene interference. BMC Genomics (2008) 1.31

The functional role of pack-MULEs in rice inferred from purifying selection and expression profile. Plant Cell (2009) 1.27

Assessing the genomic evidence for conserved transcribed pseudogenes under selection. BMC Genomics (2009) 1.26

Role of positive selection in the retention of duplicate genes in mammalian genomes. Proc Natl Acad Sci U S A (2006) 1.24

A 3' Poly(A) Tract Is Required for LINE-1 Retrotransposition. Mol Cell (2015) 1.23

Genome-scale analysis of positionally relocated genes. Genome Res (2007) 1.21

SVA retrotransposons: Evolution and genetic instability. Semin Cancer Biol (2010) 1.20

Iterative gene prediction and pseudogene removal improves genome annotation. Genome Res (2006) 1.20

Retrotransposition of gene transcripts leads to structural variation in mammalian genomes. Genome Biol (2013) 1.20

Evolutionary and expression signatures of pseudogenes in Arabidopsis and rice. Plant Physiol (2009) 1.18

Analysis of the role of retrotransposition in gene evolution in vertebrates. BMC Bioinformatics (2007) 1.17

LongSAGE profiling of nine human embryonic stem cell lines. Genome Biol (2007) 1.15

Evolutionary dynamics of the interferon-induced transmembrane gene family in vertebrates. PLoS One (2012) 1.13

Discrete subcellular partitioning of human retrotransposon RNAs despite a common mechanism of genome insertion. Hum Mol Genet (2010) 1.13

A third broad lineage of major histocompatibility complex (MHC) class I in teleost fish; MHC class II linkage and processed genes. Immunogenetics (2007) 1.13

Analysis of variable retroduplications in human populations suggests coupling of retrotransposition to cell division. Genome Res (2013) 1.13

Computational identification of 69 retroposons in Arabidopsis. Plant Physiol (2005) 1.12

Genome-wide analyses of retrogenes derived from the human box H/ACA snoRNAs. Nucleic Acids Res (2006) 1.11

Comprehensive analysis of the pseudogenes of glycolytic enzymes in vertebrates: the anomalously high number of GAPDH pseudogenes highlights a recent burst of retrotrans-positional activity. BMC Genomics (2009) 1.10

Retroposition of processed pseudogenes: the impact of RNA stability and translational control. Trends Genet (2005) 1.10

Small RNAs originated from pseudogenes: cis- or trans-acting? PLoS Comput Biol (2009) 1.05

CopywriteR: DNA copy number detection from off-target sequence data. Genome Biol (2015) 1.05

Genotyping of TRIM5 locus in northern pig-tailed macaques (Macaca leonina), a primate species susceptible to Human Immunodeficiency Virus type 1 infection. Retrovirology (2009) 1.03

HMGA1 pseudogenes as candidate proto-oncogenic competitive endogenous RNAs. Oncotarget (2014) 1.02

Structural divergence between the human and chimpanzee genomes. Hum Genet (2006) 1.01

Molecular characterization of the immune system: emergence of proteins, processes, and domains. Immunogenetics (2007) 1.01

Statistical signals in bioinformatics. Proc Natl Acad Sci U S A (2005) 1.01

Retrotransposition as a source of new promoters. Mol Biol Evol (2008) 1.00

"Orphan" retrogenes in the human genome. Mol Biol Evol (2012) 1.00

Genomic landscape of developing male germ cells. Birth Defects Res C Embryo Today (2009) 1.00

Retrosequence formation restructures the yeast genome. Genes Dev (2007) 1.00

Transposable element detection from whole genome sequence data. Mob DNA (2015) 0.99

Genomic fossils as a snapshot of the human transcriptome. Proc Natl Acad Sci U S A (2006) 0.99

Identification and characterization of pseudogenes in the rice gene complement. BMC Genomics (2009) 0.99

LinkHub: a Semantic Web system that facilitates cross-database queries and information retrieval in proteomics. BMC Bioinformatics (2007) 0.99

Bioinformatics and genomic analysis of transposable elements in eukaryotic genomes. Chromosome Res (2011) 0.96

The Influence of LINE-1 and SINE Retrotransposons on Mammalian Genomes. Microbiol Spectr (2015) 0.96

Global survey of chromatin accessibility using DNA microarrays. Genome Res (2004) 0.95

Duplication and relocation of the functional DPY19L2 gene within low copy repeats. BMC Genomics (2006) 0.95

Chance favors a prepared genome. Proc Natl Acad Sci U S A (2008) 0.95

Burst of young retrogenes and independent retrogene formation in mammals. PLoS One (2009) 0.95

Sex chromosome-to-autosome transposition events counter Y-chromosome gene loss in mammals. Genome Biol (2015) 0.93

LINEs, SINEs and other retroelements: do birds of a feather flock together? Front Biosci (Landmark Ed) (2012) 0.93

Roles for retrotransposon insertions in human disease. Mob DNA (2016) 0.92

Evolution of a major drug metabolizing enzyme defect in the domestic cat and other felidae: phylogenetic timing and the role of hypercarnivory. PLoS One (2011) 0.92

Processed pseudogenes, processed genes, and spontaneous mutations in the Arabidopsis genome. J Mol Evol (2006) 0.91

Human-specific nonsense mutations identified by genome sequence comparisons. Hum Genet (2006) 0.91

Systematic identification of pseudogenes through whole genome expression evidence profiling. Nucleic Acids Res (2006) 0.90

Origin and evolution of processed pseudogenes that stabilize functional Makorin1 mRNAs in mice, primates and other mammals. Genetics (2006) 0.90

Identification and analysis of genes and pseudogenes within duplicated regions in the human and mouse genomes. PLoS Comput Biol (2006) 0.90

Disruption of a spermatogenic cell-specific mouse enolase 4 (eno4) gene causes sperm structural defects and male infertility. Biol Reprod (2013) 0.89

The mitochondrial genome of the lycophyte Huperzia squarrosa: the most archaic form in vascular plants. PLoS One (2012) 0.88

A LINE-1 component to human aging: do LINE elements exact a longevity cost for evolutionary advantage? Mech Ageing Dev (2010) 0.88

piRNAs derived from ancient viral processed pseudogenes as transgenerational sequence-specific immune memory in mammals. RNA (2015) 0.88

Characterization of human pseudogene-derived non-coding RNAs for functional potential. PLoS One (2014) 0.87

PseudoGeneQuest - service for identification of different pseudogene types in the human genome. BMC Bioinformatics (2008) 0.86

Detecting transcription of ribosomal protein pseudogenes in diverse human tissues from RNA-seq data. BMC Genomics (2012) 0.86

NANOGP8: evolution of a human-specific retro-oncogene. G3 (Bethesda) (2012) 0.86

GeneScissors: a comprehensive approach to detecting and correcting spurious transcriptome inference owing to RNA-seq reads misalignment. Bioinformatics (2013) 0.85

Expression of type II chorionic gonadotropin genes supports a role in the male reproductive system. Mol Cell Biol (2010) 0.85

The life history of retrocopies illuminates the evolution of new mammalian genes. Genome Res (2016) 0.84

Non-LTR retrotransposons and microsatellites: Partners in genomic variation. Mob Genet Elements (2013) 0.84

Birth, decay, and reconstruction of an ancient TRIMCyp gene fusion in primate genomes. Proc Natl Acad Sci U S A (2013) 0.84

The roles and evolutionary patterns of intronless genes in deuterostomes. Comp Funct Genomics (2011) 0.84

Frequent and recent retrotransposition of orthologous genes plays a role in the evolution of sperm glycolytic enzymes. BMC Genomics (2010) 0.83

Identification and analysis of ancestral hominoid transcriptome inferred from cross-species transcript and processed pseudogene comparisons. Genome Res (2008) 0.83

Frame disruptions in human mRNA transcripts, and their relationship with splicing and protein structures. BMC Genomics (2007) 0.83

Articles cited by this

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res (1997) 665.31

CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res (1994) 392.47

Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet (2000) 336.52

Initial sequencing and analysis of the human genome. Nature (2001) 212.86

Identification of common molecular subsequences. J Mol Biol (1981) 130.53

A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J Mol Evol (1980) 113.75

The sequence of the human genome. Science (2001) 101.55

Initial sequencing and comparative analysis of the mouse genome. Nature (2002) 96.15

Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature (2000) 70.33

The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res (2000) 67.44

PAML: a program package for phylogenetic analysis by maximum likelihood. Comput Appl Biosci (1997) 45.07

A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms. Nature (2001) 42.18

The Ensembl genome database project. Nucleic Acids Res (2002) 40.87

Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. Mol Biol Evol (1986) 36.06

Identification of a candidate tumour suppressor gene, MMAC1, at chromosome 10q23.3 that is mutated in multiple advanced cancers. Nat Genet (1997) 15.38

Genome sequence of Yersinia pestis, the causative agent of plague. Nature (2001) 14.36

Estimating synonymous and nonsynonymous substitution rates under realistic evolutionary models. Mol Biol Evol (2000) 13.01

The genome sequence of Rickettsia prowazekii and the origin of mitochondria. Nature (1998) 12.53

Massive gene decay in the leprosy bacillus. Nature (2001) 11.98

Human L1 retrotransposon encodes a conserved endonuclease required for retrotransposition. Cell (1996) 8.21

Evidence for higher rates of nucleotide substitution in rodents than in man. Proc Natl Acad Sci U S A (1985) 8.18

Comparison of DNA sequences with protein sequences. Genomics (1997) 7.76

Computational inference of homologous gene structures in the human genome. Genome Res (2001) 6.96

Human LINE retrotransposons generate processed pseudogenes. Nat Genet (2000) 6.64

Mining the draft human genome. Nature (2001) 5.81

High intrinsic rate of DNA loss in Drosophila. Nature (1996) 5.51

Mechanisms of evolution in Rickettsia conorii and R. prowazekii. Science (2001) 5.32

The complete human olfactory subgenome. Genome Res (2001) 5.10

Sequence patterns indicate an enzymatic involvement in integration of mammalian retroposons. Proc Natl Acad Sci U S A (1997) 4.80

Post-transcriptional regulation of glyceraldehyde-3-phosphate-dehydrogenase gene expression in rat tissues. Nucleic Acids Res (1984) 4.79

The Ka/Ks ratio: diagnosing the form of sequence evolution. Trends Genet (2002) 4.46

Natural selection and the origin of jingwei, a chimeric processed functional gene in Drosophila. Science (1993) 4.43

Processed pseudogenes: characteristics and evolution. Annu Rev Genet (1985) 4.40

Isochores and the evolutionary genomics of vertebrates. Gene (2000) 4.13

Evolutionary analyses of the human genome. Nature (2001) 4.13

Vertebrate pseudogenes. FEBS Lett (2000) 3.93

Intron-exon structures of eukaryotic model organisms. Nucleic Acids Res (1999) 3.84

Structure and evolution of mammalian ribosomal proteins. Biochem Cell Biol (1996) 3.56

The distribution of genes in the human genome. Gene (1991) 2.99

Molecular fossils in the human genome: identification and analysis of the pseudogenes in chromosomes 21 and 22. Genome Res (2002) 2.89

Identification and analysis of over 2000 ribosomal protein pseudogenes in the human genome. Genome Res (2002) 2.88

Patterns of nucleotide substitution, insertion and deletion in the human genome inferred from pseudogenes. Nucleic Acids Res (2003) 2.85

The large srh family of chemoreceptor genes in Caenorhabditis nematodes reveals processes of genome evolution involving large duplications and deletions and intron gains and losses. Genome Res (2000) 2.51

Mitochondrial DNA: molecular fossils in the nucleus. Curr Biol (1996) 2.34

Genome-scale compositional comparisons in eukaryotes. Genome Res (2001) 2.32

Nature and structure of human genes that generate retropseudogenes. Genome Res (2000) 2.21

mRNA retroposition in human cells: processed pseudogene formation. EMBO J (1995) 2.15

A map of 75 human ribosomal protein genes. Genome Res (1998) 2.15

Studying genomes through the aeons: protein families, pseudogenes and proteome evolution. J Mol Biol (2002) 2.07

A question of size: the eukaryotic proteome and the problems in defining it. Nucleic Acids Res (2002) 2.07

The human ribosomal protein genes: sequencing and comparative analysis of 73 genes. Genome Res (2002) 2.03

Cell type-specific expression of hnRNP proteins. Exp Cell Res (1995) 1.95

Pattern of organization of human mitochondrial pseudogenes in the nuclear genome. Genome Res (2002) 1.87

Digging for dead genes: an analysis of the characteristics of the pseudogene population in the Caenorhabditis elegans genome. Nucleic Acids Res (2001) 1.85

A complete map of the human ribosomal protein genes: assignment of 80 genes to the cytogenetic map and implications for human disorders. Genomics (2001) 1.83

An approach to the organization of eukaryotic genomes at a macromolecular level. J Mol Biol (1976) 1.83

Deletions in processed pseudogenes accumulate faster in rodents than in humans. J Mol Evol (1989) 1.73

Transcriptional analysis of the PTEN/MMAC1 pseudogene, psiPTEN. Oncogene (1999) 1.72

Evidence suggesting that a fifth of annotated Caenorhabditis elegans genes may be pseudogenes. Genome Res (2002) 1.67

A small reservoir of disabled ORFs in the yeast genome and its implications for the dynamics of proteome evolution. J Mol Biol (2002) 1.62

The dominance of the population by a selected few: power-law behaviour applies to a wide variety of genomic properties. Genome Biol (2002) 1.59

Genomic gigantism: DNA loss is slow in mountain grasshoppers. Mol Biol Evol (2001) 1.56

Misunderstandings about isochores. Part 1. Gene (2001) 1.48

Identification of pseudogenes in the Drosophila melanogaster genome. Nucleic Acids Res (2003) 1.48

Identification of antisense RNA transcripts from a human DNA topoisomerase I pseudogene. Cancer Res (1992) 1.43

Similar integration but different stability of Alus and LINEs in the human genome. Gene (2001) 1.42

The human L-threonine 3-dehydrogenase gene is an expressed pseudogene. BMC Genet (2002) 1.37

Low specificity of cytokeratin 19 reverse transcriptase-polymerase chain reaction analyses for detection of hematogenous lung cancer dissemination. J Clin Oncol (1995) 1.36

Pseudogene evolution and natural selection for a compact genome. J Hered (2000) 1.33

Identification of a novel cytokeratin 19 pseudogene that may interfere with reverse transcriptase-polymerase chain reaction assays used to detect micrometastatic tumor cells. Int J Cancer (1999) 1.30

Human glyceraldehyde 3-phosphate dehydrogenase-2 gene is expressed specifically in spermatogenic cells. J Androl (2000) 1.28

Abundant adrenal-specific transcription of the human P450c21A "pseudogene". J Biol Chem (1993) 1.28

Structure and chromosomal distribution of human mitochondrial pseudogenes. Genomics (2002) 1.27

Frequent assimilation of mitochondrial DNA by grasshopper nuclear genomes. Mol Biol Evol (2000) 1.24

Antisense transcription of a murine FGFR-3 psuedogene during fetal developement. Gene (1997) 1.16

Pseudogenes. CRC Crit Rev Biochem (1986) 1.14

The human genome has 49 cytochrome c pseudogenes, including a relic of a primordial gene that still functions in mouse. Gene (2003) 1.12

mRNA-specific reverse transcription-polymerase chain reaction from human tissue extracts. Anal Biochem (2002) 1.10

A systematic investigation identifies a significant number of probable pseudogenes in the Escherichia coli genome. Gene (2002) 1.04

Identification and characterization of over 100 mitochondrial ribosomal protein pseudogenes in the human genome. Genomics (2003) 1.00

The human somatic cytochrome c gene: two classes of processed pseudogenes demarcate a period of rapid molecular evolution. Proc Natl Acad Sci U S A (1988) 0.96

Differential expression of the murine histone genes H3.3A and H3.3B. Differentiation (1997) 0.92

Concerted evolution in the GAPDH family of retrotransposed pseudogenes. Mamm Genome (1993) 0.92

Sensitivity of immunohistochemistry and polymerase chain reaction in detecting prostate cancer cells in bone marrow. J Histochem Cytochem (1994) 0.91

The human ortholog of rhesus mannose-binding protein-A gene is an expressed pseudogene that localizes to chromosome 10. Mamm Genome (1998) 0.91

Cloning, expression and nuclear localization of human NPM3, a member of the nucleophosmin/nucleoplasmin family of nuclear chaperones. BMC Genomics (2001) 0.88

Expression of prohibitin in rat seminiferous epithelium. Biol Reprod (1993) 0.86

Human myosin XVBP is a transcribed pseudogene. J Muscle Res Cell Motil (2001) 0.85

Cyclophilin A is present in rat germ cells and is associated with spermatocyte apoptosis. Reproductive Toxicology Group. Biol Reprod (1997) 0.82

Members of the human glyceraldehyde-3-phosphate dehydrogenase-related gene family map to dispersed chromosomal locations. Genomics (1989) 0.82

Cyclophilin A, the major intracellular receptor for the immunosuppressant cyclosporin A, maps to chromosome 7p11.2-p13: four pseudogenes map to chromosomes 3, 10, 14, and 18. Genomics (1995) 0.82

Articles by these authors

Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature (2007) 75.09

RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet (2009) 58.77

The transcriptional landscape of the yeast genome defined by RNA sequencing. Science (2008) 48.99

Functional profiling of the Saccharomyces cerevisiae genome. Nature (2002) 36.10

Global landscape of protein complexes in the yeast Saccharomyces cerevisiae. Nature (2006) 24.29

Landscape of transcription in human cells. Nature (2012) 20.18

GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res (2012) 19.19

Global identification of human transcribed sequences with genome tiling arrays. Science (2004) 17.85

A map of the interactome network of the metazoan C. elegans. Science (2004) 15.60

Personal omics profiling reveals dynamic molecular and medical phenotypes. Cell (2012) 12.32

A Bayesian networks approach for predicting protein-protein interactions from genomic data. Science (2003) 12.07

Genomic analysis of regulatory network dynamics reveals large topological changes. Nature (2004) 9.32

ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia. Genome Res (2012) 9.13

Extensive promoter-centered chromatin interactions provide a topological basis for transcription regulation. Cell (2012) 8.41

The minimum information required for reporting a molecular interaction experiment (MIMIx). Nat Biotechnol (2007) 8.24

Subcellular localization of the yeast proteome. Genes Dev (2002) 7.93

Global analysis of protein phosphorylation in yeast. Nature (2005) 7.46

Divergence of transcription factor binding sites across related yeast species. Science (2007) 7.10

Comparing protein abundance and mRNA expression levels on a genomic scale. Genome Biol (2003) 6.98

CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res (2011) 6.97

The transcriptional activity of human Chromosome 22. Genes Dev (2003) 6.82

Biochemical and genetic analysis of the yeast proteome with a movable ORF collection. Genes Dev (2005) 6.14

Relating whole-genome expression data with protein-protein interactions. Genome Res (2002) 5.78

The importance of bottlenecks in protein networks: correlation with gene essentiality and expression dynamics. PLoS Comput Biol (2007) 5.63

Annotation transfer between genomes: protein-protein interologs and protein-DNA regulogs. Genome Res (2004) 5.26

A cis-regulatory map of the Drosophila genome. Nature (2011) 4.80

AlleleSeq: analysis of allele-specific expression and binding in a network framework. Mol Syst Biol (2011) 4.71

New insights into Acinetobacter baumannii pathogenesis revealed by high-density pyrosequencing and transposon mutagenesis. Genes Dev (2007) 4.62

Structure and evolution of transcriptional regulatory networks. Curr Opin Struct Biol (2004) 4.48

Systematic evaluation of variability in ChIP-chip experiments using predefined DNA targets. Genome Res (2008) 4.08

Distribution of NF-kappaB-binding sites across human chromosome 22. Proc Natl Acad Sci U S A (2003) 3.89

An integrated approach for finding overlooked genes in yeast. Nat Biotechnol (2002) 3.88

Pseudogene.org: a comprehensive database and comparison platform for pseudogene annotation. Nucleic Acids Res (2006) 3.62

Classification of human genomic regions based on experimentally determined binding sites of more than 100 transcription-related factors. Genome Biol (2012) 3.61

Genomic analysis of essentiality within protein networks. Trends Genet (2004) 3.52

Genomic analysis of the hierarchical structure of regulatory networks. Proc Natl Acad Sci U S A (2006) 3.51

Modeling ChIP sequencing in silico with applications. PLoS Comput Biol (2008) 3.48

GATA-1 binding sites mapped in the beta-globin locus by using mammalian chIp-chip analysis. Proc Natl Acad Sci U S A (2002) 3.41

Structured digital abstract makes text mining easy. Nature (2007) 3.38

Defining functional DNA elements in the human genome. Proc Natl Acad Sci U S A (2014) 3.35

Systematic analysis of transcribed loci in ENCODE regions using RACE sequencing reveals extensive transcription in the human genome. Genome Biol (2008) 3.34

Dynamic transcriptomes during neural differentiation of human embryonic stem cells revealed by short, long, and paired-end sequencing. Proc Natl Acad Sci U S A (2010) 3.28

Complex transcriptional circuitry at the G1/S transition in Saccharomyces cerevisiae. Genes Dev (2002) 3.27

Multi-species microarrays reveal the effect of sequence divergence on gene expression profiles. Genome Res (2005) 3.23

Efficient yeast ChIP-Seq using multiplex short-read DNA sequencing. BMC Genomics (2009) 3.22

The Centers for Mendelian Genomics: a new large-scale initiative to identify the genes underlying rare Mendelian conditions. Am J Med Genet A (2012) 3.12

Somatic copy number mosaicism in human skin revealed by induced pluripotent stem cells. Nature (2012) 3.09

Getting connected: analysis and principles of biological networks. Genes Dev (2007) 3.03

Comprehensive Molecular Characterization of Papillary Renal-Cell Carcinoma. N Engl J Med (2015) 3.00

Diverse cellular functions of the Hsp90 molecular chaperone uncovered using systems approaches. Cell (2007) 2.95

Analyzing protein function on a genomic scale: the importance of gold-standard positives and negatives for network prediction. Curr Opin Microbiol (2004) 2.92

Close association of RNA polymerase II and many transcription factors with Pol III genes. Proc Natl Acad Sci U S A (2010) 2.90

Transcribed processed pseudogenes in the human genome: an intermediate form of expressed retrosequence lacking protein-coding ability. Nucleic Acids Res (2005) 2.89

PubNet: a flexible system for visualizing literature derived networks. Genome Biol (2005) 2.89

Molecular fossils in the human genome: identification and analysis of the pseudogenes in chromosomes 21 and 22. Genome Res (2002) 2.89

Identification and analysis of over 2000 ribosomal protein pseudogenes in the human genome. Genome Res (2002) 2.88

PseudoPipe: an automated pseudogene identification pipeline. Bioinformatics (2006) 2.85

Patterns of nucleotide substitution, insertion and deletion in the human genome inferred from pseudogenes. Nucleic Acids Res (2003) 2.85

Mapping accessible chromatin regions using Sono-Seq. Proc Natl Acad Sci U S A (2009) 2.83

The temporal patterning microRNA let-7 regulates several transcription factors at the larval to adult transition in C. elegans. Dev Cell (2005) 2.81

Modeling gene expression using chromatin features in various cellular contexts. Genome Biol (2012) 2.76

Genomics. Defining genes in the genomics era. Science (2003) 2.73

Normal mode analysis of macromolecular motions in a database framework: developing mode concentration as a useful classifying statistic. Proteins (2002) 2.72

MAPK target networks in Arabidopsis thaliana revealed using functional protein microarrays. Genes Dev (2008) 2.66

DNA replication-timing analysis of human chromosome 22 at high resolution and different developmental states. Proc Natl Acad Sci U S A (2004) 2.66

Understanding transcriptional regulation by integrative analysis of transcription factor binding data. Genome Res (2012) 2.66

Major molecular differences between mammalian sexes are involved in drug metabolism and renal function. Dev Cell (2004) 2.65

Bridging structural biology and genomics: assessing protein interaction data with known complexes. Trends Genet (2002) 2.65

HLA class II polymorphisms associated with the physiologic characteristics defined by Traditional Chinese Medicine: linking modern genetics with an ancient medicine. J Altern Complement Med (2007) 2.61

Robotic cloning and Protein Production Platform of the Northeast Structural Genomics Consortium. Methods Enzymol (2005) 2.54

Comparative analysis of processed pseudogenes in the mouse and human genomes. Trends Genet (2004) 2.51

Tilescope: online analysis pipeline for high-density tiling microarray data. Genome Biol (2007) 2.48

Genome-wide identification of binding sites defines distinct functions for Caenorhabditis elegans PHA-4/FOXA in development and environmental response. PLoS Genet (2010) 2.46

TOS9 regulates white-opaque switching in Candida albicans. Eukaryot Cell (2006) 2.42

The ribosomal protein genes and Minute loci of Drosophila melanogaster. Genome Biol (2007) 2.40

Target hub proteins serve as master regulators of development in yeast. Genes Dev (2006) 2.39

Large-scale analysis of pseudogenes in the human genome. Curr Opin Genet Dev (2004) 2.38

Genomic analysis of gene expression relationships in transcriptional regulatory networks. Trends Genet (2003) 2.38

Personal genome sequencing: current approaches and challenges. Genes Dev (2010) 2.38

Conformational changes associated with protein-protein interactions. Curr Opin Struct Biol (2004) 2.37

Issues in the analysis of oligonucleotide tiling microarrays for transcript mapping. Trends Genet (2005) 2.36

Assessing the limits of genomic data integration for predicting protein networks. Genome Res (2005) 2.33

Integrated pseudogene annotation for human chromosome 22: evidence for transcription. J Mol Biol (2005) 2.32

The protein target list of the Northeast Structural Genomics Consortium. Proteins (2004) 2.31

Diverse transcription factor binding features revealed by genome-wide ChIP-seq in C. elegans. Genome Res (2010) 2.30

CREB binds to multiple loci on human chromosome 22. Mol Cell Biol (2004) 2.28

RSEQtools: a modular framework to analyze RNA-Seq data using compact, anonymized data summaries. Bioinformatics (2010) 2.26

Spectral biclustering of microarray data: coclustering genes and conditions. Genome Res (2003) 2.25

3V: cavity, channel and cleft volume calculator and extractor. Nucleic Acids Res (2010) 2.20

Regulation of gene expression by a metabolic enzyme. Science (2004) 2.15

Differential binding of calmodulin-related proteins to their targets revealed through high-density Arabidopsis protein microarrays. Proc Natl Acad Sci U S A (2007) 2.15

Open access: taking full advantage of the content. PLoS Comput Biol (2008) 2.14

Information assessment on predicting protein-protein interactions. BMC Bioinformatics (2004) 2.14

Identification and analysis of unitary pseudogenes: historic and contemporary gene losses in humans and other primates. Genome Biol (2010) 2.12

PSORS2 is due to mutations in CARD14. Am J Hum Genet (2012) 2.10

Defining the TRiC/CCT interactome links chaperonin function to stabilization of newly made proteins with complex topologies. Nat Struct Mol Biol (2008) 2.07

Studying genomes through the aeons: protein families, pseudogenes and proteome evolution. J Mol Biol (2002) 2.07