Building the sequence map of the human pan-genome.

PubWeight™: 5.53‹?› | Rank: Top 1%

🔗 View Article (PMID 19997067)

Published in Nat Biotechnol on December 07, 2009

Authors

Ruiqiang Li1, Yingrui Li, Hancheng Zheng, Ruibang Luo, Hongmei Zhu, Qibin Li, Wubin Qian, Yuanyuan Ren, Geng Tian, Jinxiang Li, Guangyu Zhou, Xuan Zhu, Honglong Wu, Junjie Qin, Xin Jin, Dongfang Li, Hongzhi Cao, Xueda Hu, Hélène Blanche, Howard Cann, Xiuqing Zhang, Songgang Li, Lars Bolund, Karsten Kristiansen, Huanming Yang, Jun Wang, Jian Wang

Author Affiliations

1: BGI-Shenzhen, Shenzhen 518083, China.

Articles citing this

De novo assembly of human genomes with massively parallel short read sequencing. Genome Res (2009) 45.91

A survey of sequence alignment algorithms for next-generation sequencing. Brief Bioinform (2010) 18.05

Personal omics profiling reveals dynamic molecular and medical phenotypes. Cell (2012) 12.32

Limitations of next-generation genome sequence assembly. Nat Methods (2010) 9.04

Assembly algorithms for next-generation sequencing data. Genomics (2010) 8.56

Characterization of missing human genome sequences and copy-number polymorphic insertions. Nat Methods (2010) 5.44

Haplotype-resolved genome sequencing of a Gujarati Indian individual. Nat Biotechnol (2010) 5.32

Modernizing reference genome assemblies. PLoS Biol (2011) 5.23

Resequencing of 31 wild and cultivated soybean genomes identifies patterns of genetic diversity and selection. Nat Genet (2010) 5.20

Fast identification and removal of sequence contamination from genomic and metagenomic datasets. PLoS One (2011) 3.74

PathSeq: software to identify or discover microbes by deep sequencing of human tissue. Nat Biotechnol (2011) 3.57

iPath2.0: interactive pathway explorer. Nucleic Acids Res (2011) 2.50

Structural variation in two human genomes mapped at single-nucleotide resolution by whole genome de novo assembly. Nat Biotechnol (2011) 2.45

Annotating non-coding regions of the genome. Nat Rev Genet (2010) 2.38

Detection and characterization of novel sequence insertions using paired-end next-generation sequencing. Bioinformatics (2010) 2.23

Whole-genome sequencing and comprehensive variant analysis of a Japanese individual using massively parallel sequencing. Nat Genet (2010) 2.08

Detecting false-positive signals in exome sequencing. Hum Mutat (2012) 1.87

Between a chicken and a grape: estimating the number of human genes. Genome Biol (2010) 1.72

Maize (Zea mays L.) genome diversity as revealed by RNA-sequencing. PLoS One (2012) 1.57

State of the art de novo assembly of human genomes from massively parallel sequencing data. Hum Genomics (2010) 1.51

Population-genetic properties of differentiated human copy-number polymorphisms. Am J Hum Genet (2011) 1.50

Single haplotype assembly of the human genome from a hydatidiform mole. Genome Res (2014) 1.44

Using population admixture to help complete maps of the human genome. Nat Genet (2013) 1.35

Genetic variation and the de novo assembly of human genomes. Nat Rev Genet (2015) 1.24

De novo transcriptome characterization of Vitis vinifera cv. Corvina unveils varietal diversity. BMC Genomics (2013) 1.13

Targeted assembly of short sequence reads. PLoS One (2011) 1.12

Low frequency variants, collapsed based on biological knowledge, uncover complexity of population stratification in 1000 genomes project data. PLoS Genet (2013) 1.12

Next-generation sequencing of experimental mouse strains. Mamm Genome (2012) 1.10

Novel variation and de novo mutation rates in population-wide de novo assembled Danish trios. Nat Commun (2015) 1.09

Deconstructing Mus gemischus: advances in understanding ancestry, structure, and variation in the genome of the laboratory mouse. Mamm Genome (2012) 1.08

The high polyphenol content of grapevine cultivar tannat berries is conferred primarily by genes that are not shared with the reference genome. Plant Cell (2013) 1.04

Limitations of the human reference genome for personalized genomics. PLoS One (2012) 1.04

cnvHiTSeq: integrative models for high-resolution copy number variation detection and genotyping using population sequencing data. Genome Biol (2012) 0.98

De novo assembly of a haplotype-resolved human genome. Nat Biotechnol (2015) 0.98

Chinese institute makes bold sequencing play. Nat Biotechnol (2010) 0.98

Human genetics and genomics a decade after the release of the draft sequence of the human genome. Hum Genomics (2011) 0.95

Long-read sequencing and de novo assembly of a Chinese genome. Nat Commun (2016) 0.93

Cost-effective prediction of gender-labeling errors and estimation of gender-labeling error rates in candidate-gene association studies. Front Genet (2011) 0.92

Exploring the rice dispensable genome using a metagenome-like assembly strategy. Genome Biol (2015) 0.91

Design and Implementation of the International Genetics and Translational Research in Transplantation Network. Transplantation (2015) 0.90

Chinese bioscience: The sequence factory. Nature (2010) 0.87

Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly. Genome Res (2017) 0.86

Assembly of non-unique insertion content using next-generation sequencing. BMC Bioinformatics (2011) 0.86

The human transcriptome: an unfinished story. Genes (Basel) (2012) 0.86

Discovery of common sequences absent in the human reference genome using pooled samples from next generation sequencing. BMC Genomics (2014) 0.83

Revealing the missing expressed genes beyond the human reference genome by RNA-Seq. BMC Genomics (2011) 0.83

Incorporating the human gene annotations in different databases significantly improved transcriptomic and genetic analyses. RNA (2013) 0.82

VCGDB: a dynamic genome database of the Chinese population. BMC Genomics (2014) 0.82

Paired-end sequencing of long-range DNA fragments for de novo assembly of large, complex Mammalian genomes by direct intra-molecule ligation. PLoS One (2012) 0.82

The genome of a Mongolian individual reveals the genetic imprints of Mongolians on modern human populations. Genome Biol Evol (2014) 0.81

Comprehensive transcriptome analysis of developing xylem responding to artificial bending and gravitational stimuli in Betula platyphylla. PLoS One (2014) 0.80

Comprehensively identifying and characterizing the missing gene sequences in human reference genome with integrated analytic approaches. Hum Genet (2013) 0.80

Ontology-based Vaccine and Drug Adverse Event Representation and Theory-guided Systematic Causal Network Analysis toward Integrative Pharmacovigilance Research. Curr Pharmacol Rep (2016) 0.79

Genome Reduction Uncovers a Large Dispensable Genome and Adaptive Role for Copy Number Variation in Asexually Propagated Solanum tuberosum. Plant Cell (2016) 0.79

Harnessing the power of genomics and immunoinformatics to produce improved vaccines. Expert Opin Drug Discov (2010) 0.79

The composition of the global and feature specific cyanobacterial core-genomes. Front Microbiol (2015) 0.78

DIAMUND: direct comparison of genomes to detect mutations. Hum Mutat (2014) 0.78

Involvement of a citrus meiotic recombination TTC-repeat motif in the formation of gross deletions generated by ionizing radiation and MULE activation. BMC Genomics (2015) 0.78

Discovery, genotyping and characterization of structural variation and novel sequence at single nucleotide resolution from de novo genome assemblies on a population scale. Gigascience (2015) 0.78

Comprehensive assembly of novel transcripts from unmapped human RNA-Seq data and their association with cancer. Mol Syst Biol (2015) 0.77

Genome-scale transcriptome analysis in response to nitric oxide in birch cells: implications of the triterpene biosynthetic pathway. PLoS One (2014) 0.77

Admixture mapping: from paradigms of race and ethnicity to population history. Hugo J (2010) 0.76

Anchored pseudo-de novo assembly of human genomes identifies extensive sequence variation from unmapped sequence reads. Hum Genet (2016) 0.76

Analysis of Plant Pan-Genomes and Transcriptomes with GET_HOMOLOGUES-EST, a Clustering Solution for Sequences of the Same Species. Front Plant Sci (2017) 0.76

Alignment of Short Reads: A Crucial Step for Application of Next-Generation Sequencing Data in Precision Medicine. Pharmaceutics (2015) 0.76

The perspective from EASAC and FEAM on direct-to-consumer genetic testing for health-related purposes. Eur J Hum Genet (2012) 0.76

RPAN: rice pan-genome browser for ∼3000 rice genomes. Nucleic Acids Res (2016) 0.75

Individual Genome of the Russian Male: SNP Calling and a de novo Assembly of Unmapped Reads. Acta Naturae (2010) 0.75

Discrepancies between human DNA, mRNA and protein reference sequences and their relation to single nucleotide variants in the human population. Database (Oxford) (2016) 0.75

An ethnically relevant consensus Korean reference genome is a step towards personal reference genomes. Nat Commun (2016) 0.75

Reproductive and developmental genomics retreat at Cornell University, 2012. Mol Reprod Dev (2012) 0.75

Improving the Power of Structural Variation Detection by Augmenting the Reference. PLoS One (2015) 0.75

Genome reassembly with high-throughput sequencing data. BMC Genomics (2013) 0.75

NSIT: novel sequence identification tool. PLoS One (2014) 0.75

Graphtyper enables population-scale genotyping using pangenome graphs. Nat Genet (2017) 0.75

Articles cited by this

Basic local alignment search tool. J Mol Biol (1990) 659.07

BLAT--the BLAST-like alignment tool. Genome Res (2002) 126.78

A haplotype map of the human genome. Nature (2005) 105.70

Accurate whole human genome sequencing using reversible terminator chemistry. Nature (2008) 90.20

Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. Genetics (2003) 53.11

The complete genome of an individual by massively parallel DNA sequencing. Nature (2008) 52.81

Detection of large-scale variation in the human genome. Nat Genet (2004) 49.18

The diploid genome sequence of an Asian individual. Nature (2008) 46.29

De novo assembly of human genomes with massively parallel short read sequencing. Genome Res (2009) 45.91

The diploid genome sequence of an individual human. PLoS Biol (2007) 44.80

A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms. Nature (2001) 42.18

Finishing the euchromatic sequence of the human genome. Nature (2004) 41.40

Large-scale copy number polymorphism in the human genome. Science (2004) 34.64

Genome-wide association studies for common diseases and complex traits. Nat Rev Genet (2005) 33.96

Genetic structure of human populations. Science (2002) 30.91

Mapping and sequencing of structural variation from eight human genomes. Nature (2008) 30.28

Characterization of single-nucleotide polymorphisms in coding regions of human genes. Nat Genet (1999) 24.24

Worldwide human relationships inferred from genome-wide patterns of variation. Science (2008) 22.44

Whole-genome patterns of common DNA variation in three human populations. Science (2005) 21.22

A human genome diversity cell line panel. Science (2002) 14.11

Genotype, haplotype and copy-number variation in worldwide human populations. Nature (2008) 12.40

Human genetic variation and its contribution to complex traits. Nat Rev Genet (2009) 12.11

The genetic structure and history of Africans and African Americans. Science (2009) 10.65

Inference of population structure using multilocus genotype data: dominant markers and null alleles. Mol Ecol Notes (2007) 10.11

The first Korean genome sequence and analysis: full genome sequencing for a socio-ethnic group. Genome Res (2009) 7.87

Genome assembly comparison identifies structural variants in the human genome. Nat Genet (2006) 6.93

Genetic variation and population structure in native Americans. PLoS Genet (2007) 4.87

Closing gaps in the human genome with fosmid resources generated from multiple individuals. Nat Genet (2007) 4.30

Structural classification of zinc fingers: survey and summary. Nucleic Acids Res (2003) 3.94

Use of y chromosome and mitochondrial DNA population structure in tracing human migrations. Annu Rev Genet (2007) 3.15

Classification and nomenclature of all human homeobox genes. BMC Biol (2007) 2.61

The Human Genome Diversity Project: past, present and future. Nat Rev Genet (2005) 2.38

The MUC family: an obituary. Trends Biochem Sci (2002) 1.80

Extensive copy-number variation of the human olfactory receptor gene family. Am J Hum Genet (2008) 1.71

A population threshold for functional polymorphisms. Genome Res (2003) 1.52

Active genes in junk DNA? Characterization of DUX genes embedded within 3.3 kb repeated elements. Gene (2001) 1.10

Faster human genome sequencing. Nat Biotechnol (2009) 0.99

Articles by these authors

A second generation human haplotype map of over 3.1 million SNPs. Nature (2007) 85.39

SOAP: short oligonucleotide alignment program. Bioinformatics (2008) 68.13

The diploid genome sequence of an Asian individual. Nature (2008) 46.29

De novo assembly of human genomes with massively parallel short read sequencing. Genome Res (2009) 45.91

A human gut microbial gene catalogue established by metagenomic sequencing. Nature (2010) 43.63

A draft sequence of the rice genome (Oryza sativa L. ssp. indica). Science (2002) 42.78

SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics (2009) 39.47

Enterotypes of the human gut microbiome. Nature (2011) 24.36

SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. Gigascience (2012) 20.89

Characterization of microRNAs in serum: a novel class of biomarkers for diagnosis of cancer and other diseases. Cell Res (2008) 20.59

Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences. Proc Natl Acad Sci U S A (2002) 20.48

The genome sequence of the malaria mosquito Anopheles gambiae. Science (2002) 20.36

International network of cancer genome projects. Nature (2010) 20.35

Evolution of genes and genomes on the Drosophila phylogeny. Nature (2007) 18.01

Genome-wide detection and characterization of positive selection in human populations. Nature (2007) 17.27

SNP detection for massively parallel whole-genome resequencing. Genome Res (2009) 15.96

The sequence and de novo assembly of the giant panda genome. Nature (2009) 15.76

WEGO: a web tool for plotting GO annotations. Nucleic Acids Res (2006) 13.06

Mapping copy number variation by population-scale genome sequencing. Nature (2011) 12.55

A systematic survey of loss-of-function variants in human protein-coding genes. Science (2012) 12.25

A metagenome-wide association study of gut microbiota in type 2 diabetes. Nature (2012) 11.68