De novo assembly of human genomes with massively parallel short read sequencing.

PubWeight™: 45.91‹?› | Rank: Top 0.01% | All-Time Top 1000

🔗 View Article (PMC 2813482)

Published in Genome Res on December 17, 2009

Authors

Ruiqiang Li1, Hongmei Zhu, Jue Ruan, Wubin Qian, Xiaodong Fang, Zhongbin Shi, Yingrui Li, Shengting Li, Gao Shan, Karsten Kristiansen, Songgang Li, Huanming Yang, Jian Wang, Jun Wang

Author Affiliations

1: Beijing Genomics Institute at Shenzhen, Shenzhen 518083, China.

Articles citing this

(truncated to the top 100)

SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol (2012) 62.36

A human gut microbial gene catalogue established by metagenomic sequencing. Nature (2010) 43.63

High-quality draft assemblies of mammalian genomes from massively parallel sequence data. Proc Natl Acad Sci U S A (2010) 22.97

SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. Gigascience (2012) 20.89

A survey of sequence alignment algorithms for next-generation sequencing. Brief Bioinform (2010) 18.05

The sequence and de novo assembly of the giant panda genome. Nature (2009) 15.76

FLASH: fast length adjustment of short reads to improve genome assemblies. Bioinformatics (2011) 13.71

QUAST: quality assessment tool for genome assemblies. Bioinformatics (2013) 13.07

Quake: quality-aware detection and correction of sequencing errors. Genome Biol (2010) 12.52

GAGE: A critical evaluation of genome assemblies and assembly algorithms. Genome Res (2012) 11.33

Hybrid error correction and de novo assembly of single-molecule sequencing reads. Nat Biotechnol (2012) 10.31

Limitations of next-generation genome sequence assembly. Nat Methods (2010) 9.04

An integrated pipeline for de novo assembly of microbial genomes. PLoS One (2012) 8.97

Dindel: accurate indel calls from short-read data. Genome Res (2010) 8.62

Assembly algorithms for next-generation sequencing data. Genomics (2010) 8.56

Assemblathon 1: a competitive assessment of de novo short read assembly methods. Genome Res (2011) 8.38

The genome of the mesopolyploid crop species Brassica rapa. Nat Genet (2011) 8.23

Genome sequence and analysis of the tuber crop potato. Nature (2011) 7.77

Genome structural variation discovery and genotyping. Nat Rev Genet (2011) 7.34

Assembling single-cell genomes and mini-metagenomes from chimeric MDA products. J Comput Biol (2013) 6.90

Efficient de novo assembly of large genomes using compressed data structures. Genome Res (2011) 6.05

Assembly of large genomes using second-generation sequencing. Genome Res (2010) 5.94

Improving draft assemblies by iterative mapping and assembly of short reads to eliminate gaps. Genome Biol (2010) 5.79

De novo assembly and genotyping of variants using colored de Bruijn graphs. Nat Genet (2012) 5.61

Efficient de novo assembly of single-cell bacterial genomes from short-read data sets. Nat Biotechnol (2011) 5.60

Building the sequence map of the human pan-genome. Nat Biotechnol (2009) 5.53

High-throughput sequencing reveals extensive variation in human-specific L1 content in individual human genomes. Genome Res (2010) 5.38

The oyster genome reveals stress adaptation and complexity of shell formation. Nature (2012) 5.30

De novo characterization of a whitefly transcriptome and analysis of its gene expression during development. BMC Genomics (2010) 5.17

Multiple reference genomes and transcriptomes for Arabidopsis thaliana. Nature (2011) 5.13

A scalable, fully automated process for construction of sequence-ready human exome targeted capture libraries. Genome Biol (2011) 5.11

The MaSuRCA genome assembler. Bioinformatics (2013) 5.07

Using the Velvet de novo assembler for short-read sequencing technologies. Curr Protoc Bioinformatics (2010) 4.88

Resequencing 50 accessions of cultivated and wild rice yields markers for identifying agronomically important genes. Nat Biotechnol (2011) 4.84

CISA: contig integrator for sequence assembly of bacterial genomes. PLoS One (2013) 4.62

The draft genomes of soft-shell turtle and green sea turtle yield insights into the development and evolution of the turtle-specific body plan. Nat Genet (2013) 4.40

Efficient de novo assembly of highly heterozygous genomes from whole-genome shotgun short reads. Genome Res (2014) 4.29

Mind the gap: upgrading genomes with Pacific Biosciences RS long-read sequencing technology. PLoS One (2012) 4.18

Genome-wide patterns of genetic variation among elite maize inbred lines. Nat Genet (2010) 4.06

Low concordance of multiple variant-calling pipelines: practical implications for exome and genome sequencing. Genome Med (2013) 3.90

MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads. Nucleic Acids Res (2012) 3.52

Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop of resource-poor farmers. Nat Biotechnol (2011) 3.51

The monarch butterfly genome yields insights into long-distance migration. Cell (2011) 3.51

Genome mapping on nanochannel arrays for structural variation analysis and sequence assembly. Nat Biotechnol (2012) 3.43

Genomic analyses identify distinct patterns of selection in domesticated pigs and Tibetan wild boars. Nat Genet (2013) 3.41

Aegilops tauschii draft genome sequence reveals a gene repertoire for wheat adaptation. Nature (2013) 3.40

WebMGA: a customizable web server for fast metagenomic sequence analysis. BMC Genomics (2011) 3.39

Mugsy: fast multiple alignment of closely related whole genomes. Bioinformatics (2010) 3.38

How to apply de Bruijn graphs to genome assembly. Nat Biotechnol (2011) 3.36

De novo assembly and characterization of root transcriptome using Illumina paired-end sequencing and development of cSSR markers in sweet potato (Ipomoea batatas). BMC Genomics (2010) 3.32

A hybrid approach for the automated finishing of bacterial genomes. Nat Biotechnol (2012) 3.29

The draft genome of a diploid cotton Gossypium raimondii. Nat Genet (2012) 3.18

Alterations of the human gut microbiome in liver cirrhosis. Nature (2014) 3.02

Whole-genome sequence of Schistosoma haematobium. Nat Genet (2012) 2.91

Exploiting sparseness in de novo genome assembly. BMC Bioinformatics (2012) 2.88

The genomic sequence of the Chinese hamster ovary (CHO)-K1 cell line. Nat Biotechnol (2011) 2.87

Draft genome of the wheat A-genome progenitor Triticum urartu. Nature (2013) 2.86

PRICE: software for the targeted assembly of components of (Meta) genomic sequence data. G3 (Bethesda) (2013) 2.85

Deep sequencing of the Camellia sinensis transcriptome revealed candidate genes for major metabolic pathways of tea-specific compounds. BMC Genomics (2011) 2.84

Genome sequence of foxtail millet (Setaria italica) provides insights into grass evolution and biofuel potential. Nat Biotechnol (2012) 2.82

The draft genome of sweet orange (Citrus sinensis). Nat Genet (2012) 2.81

Identification of PRRT2 as the causative gene of paroxysmal kinesigenic dyskinesias. Brain (2011) 2.80

ConDeTri--a content dependent read trimmer for Illumina data. PLoS One (2011) 2.78

GapFiller: a de novo assembly approach to fill the gap within paired reads. BMC Bioinformatics (2012) 2.72

Direct comparisons of Illumina vs. Roche 454 sequencing technologies on the same microbial community DNA sample. PLoS One (2012) 2.68

A beginner's guide to eukaryotic genome annotation. Nat Rev Genet (2012) 2.67

Tackling soil diversity with the assembly of large, complex metagenomes. Proc Natl Acad Sci U S A (2014) 2.65

A practical comparison of de novo genome assembly software tools for next-generation sequencing technologies. PLoS One (2011) 2.55

Characterization of the sesame (Sesamum indicum L.) global transcriptome using Illumina paired-end sequencing and development of EST-SSR markers. BMC Genomics (2011) 2.53

Sequencing technologies and genome sequencing. J Appl Genet (2011) 2.52

Phylotranscriptomic analysis of the origin and early diversification of land plants. Proc Natl Acad Sci U S A (2014) 2.48

Meta-IDBA: a de Novo assembler for metagenomic data. Bioinformatics (2011) 2.46

Exploring single-sample SNP and INDEL calling with whole-genome de novo assembly. Bioinformatics (2012) 2.46

Sequencing and automated whole-genome optical mapping of the genome of a domestic goat (Capra hircus). Nat Biotechnol (2012) 2.45

Structural variation in two human genomes mapped at single-nucleotide resolution by whole genome de novo assembly. Nat Biotechnol (2011) 2.45

The impact of next-generation sequencing on genomics. J Genet Genomics (2011) 2.41

Challenges of sequencing human genomes. Brief Bioinform (2010) 2.39

Comparing de novo genome assembly: the long and short of it. PLoS One (2011) 2.37

Genome sequencing and comparison of two nonhuman primate animal models, the cynomolgus and Chinese rhesus macaques. Nat Biotechnol (2011) 2.37

The draft genome of watermelon (Citrullus lanatus) and resequencing of 20 diverse accessions. Nat Genet (2012) 2.33

Space-efficient and exact de Bruijn graph representation based on a Bloom filter. Algorithms Mol Biol (2013) 2.25

Bambus 2: scaffolding metagenomes. Bioinformatics (2011) 2.24

GAGE-B: an evaluation of genome assemblers for bacterial organisms. Bioinformatics (2013) 2.24

A roadmap for natural product discovery based on large-scale genomics and metabolomics. Nat Chem Biol (2014) 2.23

Detection and characterization of novel sequence insertions using paired-end next-generation sequencing. Bioinformatics (2010) 2.23

Historical variations in mutation rate in an epidemic pathogen, Yersinia pestis. Proc Natl Acad Sci U S A (2012) 2.22

Diverse CRISPRs evolving in human microbiomes. PLoS Genet (2012) 2.20

De novo analysis of transcriptome dynamics in the migratory locust during the development of phase traits. PLoS One (2010) 2.19

Complete viral RNA genome sequencing of ultra-low copy samples by sequence-independent amplification. Nucleic Acids Res (2012) 2.19

De novo assembly and characterization of bark transcriptome using Illumina sequencing and development of EST-SSR markers in rubber tree (Hevea brasiliensis Muell. Arg.). BMC Genomics (2012) 2.11

Sequence assembly demystified. Nat Rev Genet (2013) 2.09

Whole-genome sequencing and comprehensive variant analysis of a Japanese individual using massively parallel sequencing. Nat Genet (2010) 2.08

Transcriptomic analysis of Chinese bayberry (Myrica rubra) fruit development and ripening using RNA-Seq. BMC Genomics (2012) 2.07

Comparative studies of de novo assembly tools for next-generation sequencing technologies. Bioinformatics (2011) 2.05

Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions. Nat Biotechnol (2013) 2.05

The draft genome and transcriptome of Cannabis sativa. Genome Biol (2011) 2.03

Evidence of cellulose metabolism by the giant panda gut microbiome. Proc Natl Acad Sci U S A (2011) 2.00

Polar and brown bear genomes reveal ancient admixture and demographic footprints of past climate change. Proc Natl Acad Sci U S A (2012) 1.98

Deep sequencing of the oral microbiome reveals signatures of periodontal disease. PLoS One (2012) 1.96

From Peer-Reviewed to Peer-Reproduced in Scholarly Publishing: The Complementary Roles of Data Models and Workflows in Bioinformatics. PLoS One (2015) 1.95

Articles cited by this

Initial sequencing and analysis of the human genome. Nature (2001) 212.86

Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res (2008) 151.16

The sequence of the human genome. Science (2001) 101.55

Initial sequencing and comparative analysis of the mouse genome. Nature (2002) 96.15

Accurate whole human genome sequencing using reversible terminator chemistry. Nature (2008) 90.20

A second generation human haplotype map of over 3.1 million SNPs. Nature (2007) 85.39

SOAP: short oligonucleotide alignment program. Bioinformatics (2008) 68.13

The diploid genome sequence of an Asian individual. Nature (2008) 46.29

ABySS: a parallel assembler for short read sequence data. Genome Res (2009) 43.20

SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics (2009) 39.47

A whole-genome assembly of Drosophila. Science (2000) 38.48

Whole-genome re-sequencing. Curr Opin Genet Dev (2006) 35.24

An Eulerian path approach to DNA fragment assembly. Proc Natl Acad Sci U S A (2001) 31.51

Mapping and sequencing of structural variation from eight human genomes. Nature (2008) 30.28

ARACHNE: a whole-genome shotgun assembler. Genome Res (2002) 22.72

ALLPATHS: de novo assembly of whole-genome shotgun microreads. Genome Res (2008) 20.61

Assembling millions of short DNA sequences using SSAKE. Bioinformatics (2006) 18.71

SHARCGS, a fast and highly accurate short-read assembly algorithm for de novo genomic sequencing. Genome Res (2007) 16.20

SNP detection for massively parallel whole-genome resequencing. Genome Res (2009) 15.96

The sequence and de novo assembly of the giant panda genome. Nature (2009) 15.76

Short read fragment assembly of bacterial genomes. Genome Res (2007) 15.40

The phusion assembler. Genome Res (2003) 15.25

De novo bacterial genome sequencing: millions of very short reads assembled on a desktop computer. Genome Res (2008) 14.90

Extending assembly of short DNA sequences to handle error. Bioinformatics (2007) 14.46

PCAP: a whole-genome assembly program. Genome Res (2003) 12.36

Single-molecule DNA sequencing of a viral genome. Science (2008) 11.66

The Atlas genome assembly system. Genome Res (2004) 9.78

Advanced sequencing technologies: methods and goals. Nat Rev Genet (2004) 9.31

The genome of the cucumber, Cucumis sativus L. Nat Genet (2009) 8.19

Building the sequence map of the human pan-genome. Nat Biotechnol (2009) 5.53

RePS: a sequence assembler that masks exact repeats identified from the shotgun data. Genome Res (2002) 4.35

Articles by these authors

The Sequence Alignment/Map format and SAMtools. Bioinformatics (2009) 232.39

Mapping short DNA sequencing reads and calling variants using mapping quality scores. Genome Res (2008) 157.44

A second generation human haplotype map of over 3.1 million SNPs. Nature (2007) 85.39

SOAP: short oligonucleotide alignment program. Bioinformatics (2008) 68.13

The diploid genome sequence of an Asian individual. Nature (2008) 46.29

A human gut microbial gene catalogue established by metagenomic sequencing. Nature (2010) 43.63

A draft sequence of the rice genome (Oryza sativa L. ssp. indica). Science (2002) 42.78

SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics (2009) 39.47

Enterotypes of the human gut microbiome. Nature (2011) 24.36

SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. Gigascience (2012) 20.89

Characterization of microRNAs in serum: a novel class of biomarkers for diagnosis of cancer and other diseases. Cell Res (2008) 20.59

Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences. Proc Natl Acad Sci U S A (2002) 20.48

The genome sequence of the malaria mosquito Anopheles gambiae. Science (2002) 20.36

International network of cancer genome projects. Nature (2010) 20.35

Evolution of genes and genomes on the Drosophila phylogeny. Nature (2007) 18.01

Genome-wide detection and characterization of positive selection in human populations. Nature (2007) 17.27

SNP detection for massively parallel whole-genome resequencing. Genome Res (2009) 15.96

The sequence and de novo assembly of the giant panda genome. Nature (2009) 15.76

WEGO: a web tool for plotting GO annotations. Nucleic Acids Res (2006) 13.06

Mapping copy number variation by population-scale genome sequencing. Nature (2011) 12.55

A systematic survey of loss-of-function variants in human protein-coding genes. Science (2012) 12.25

A metagenome-wide association study of gut microbiota in type 2 diabetes. Nature (2012) 11.68

Sequencing of 50 human exomes reveals adaptation to high altitude. Science (2010) 11.27

A comparison of whole-genome shotgun-derived mouse chromosome 16 and the human genome. Science (2002) 9.59

TreeFam: a curated database of phylogenetic trees of animal gene families. Nucleic Acids Res (2006) 8.83

Assemblathon 1: a competitive assessment of de novo short read assembly methods. Genome Res (2011) 8.38

The genome of the mesopolyploid crop species Brassica rapa. Nat Genet (2011) 8.23

The genome of the cucumber, Cucumis sativus L. Nat Genet (2009) 8.19