FastUniq: a fast de novo duplicates removal tool for paired short reads.

PubWeight™: 1.54‹?› | Rank: Top 4%

🔗 View Article (PMC 3527383)

Published in PLoS One on December 20, 2012

Authors

Haibin Xu1, Xiang Luo, Jun Qian, Xiaohui Pang, Jingyuan Song, Guangrui Qian, Jinhui Chen, Shilin Chen

Author Affiliations

1: National Engineering Laboratory for Breeding of Endangered Medicinal Materials, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, People's Republic of China.

Articles citing this

Molecular Epidemiology of Colonizing and Infecting Isolates of Klebsiella pneumoniae. mSphere (2016) 1.01

Global RNA recognition patterns of post-transcriptional regulators Hfq and CsrA revealed by UV crosslinking in vivo. EMBO J (2016) 0.98

Developmental Acquisition of Regulomes Underlies Innate Lymphoid Cell Functionality. Cell (2016) 0.93

Metagenome-assembled genomes uncover a global brackish microbiome. Genome Biol (2015) 0.90

Modular approach to customise sample preparation procedures for viral metagenomics: a reproducible protocol for virome analysis. Sci Rep (2015) 0.88

HUGO: Hierarchical mUlti-reference Genome cOmpression for aligned reads. J Am Med Inform Assoc (2013) 0.87

Hot spots of DNA double-strand breaks in human rDNA units are produced in vivo. Sci Rep (2016) 0.87

Evaluating the necessity of PCR duplicate removal from next-generation sequencing data and a comparison of approaches. BMC Bioinformatics (2016) 0.84

Environmental Viral Genomes Shed New Light on Virus-Host Interactions in the Ocean. mSphere (2017) 0.84

Microbial Gene Abundance and Expression Patterns across a River to Ocean Salinity Gradient. PLoS One (2015) 0.82

Transcriptomic and epigenomic characterization of the developing bat wing. Nat Genet (2016) 0.81

TRAPLINE: a standardized and automated pipeline for RNA sequencing data analysis, evaluation and annotation. BMC Bioinformatics (2016) 0.79

G-CNV: A GPU-Based Tool for Preparing Data to Detect CNVs with Read-Depth Methods. Front Bioeng Biotechnol (2015) 0.78

The Genome and Methylome of a Subsocial Small Carpenter Bee, Ceratina calcarata. Genome Biol Evol (2016) 0.78

An ultra-high density genetic linkage map of perennial ryegrass (Lolium perenne) using genotyping by sequencing (GBS) based on a reference shotgun genome assembly. Ann Bot (2016) 0.77

Combined de novo and genome guided assembly and annotation of the Pinus patula juvenile shoot transcriptome. BMC Genomics (2015) 0.77

A genomic selection component analysis characterizes migration-selection balance. Evolution (2015) 0.77

Transposable elements in a clade of three tetraploids and a diploid relative, focusing on Gypsy amplification. Mob DNA (2015) 0.77

Transcriptome profiling of drought responsive noncoding RNAs and their target genes in rice. BMC Genomics (2016) 0.76

Genomewide ancestry and divergence patterns from low-coverage sequencing data reveal a complex history of admixture in wild baboons. Mol Ecol (2016) 0.76

Validation of picogram- and femtogram-input DNA libraries for microscale metagenomics. PeerJ (2016) 0.76

A robust transcriptional program in newts undergoing multiple events of lens regeneration throughout their lifespan. Elife (2015) 0.76

An integrative method to normalize RNA-Seq data. BMC Bioinformatics (2014) 0.76

Thrombotic Microangiopathy in Inverted Formin 2-Mediated Renal Disease. J Am Soc Nephrol (2016) 0.75

Tropical ancient DNA reveals relationships of the extinct Bahamian giant tortoise Chelonoidis alburyorum. Proc Biol Sci (2017) 0.75

Draft Assembly of Elite Inbred Line PH207 Provides Insights into Genomic and Transcriptome Diversity in Maize. Plant Cell (2016) 0.75

Draft Genome Sequence of Idiomarina sp. Strain 5.13, a Highly Stress-Resistant Bacterium Isolated from the Southwest Indian Ridge. Genome Announc (2017) 0.75

Chemosensory adaptations of the mountain fly Drosophila nigrosparsa (Insecta: Diptera) through genomics' and structural biology's lenses. Sci Rep (2017) 0.75

Genome sequences of two closely related strains of Escherichia coli K-12 GM4792. Stand Genomic Sci (2015) 0.75

Genome Sequence of Rhodococcus sp. Strain PML026, a Trehalolipid Biosurfactant Producer and Biodegrader of Oil and Alkanes. Genome Announc (2015) 0.75

Removing duplicate reads using graphics processing units. BMC Bioinformatics (2016) 0.75

Identification of Heterozygous Single- and Multi-exon Deletions in IL7R by Whole Exome Sequencing. J Clin Immunol (2016) 0.75

A Survey of Computational Tools to Analyze and Interpret Whole Exome Sequencing Data. Int J Genomics (2016) 0.75

Museomics resolve the systematics of an endangered grass lineage endemic to north-western Madagascar. Ann Bot (2016) 0.75

The A, C, G, and T of Genome Assembly. Biomed Res Int (2016) 0.75

Genome Sequence of the Edible Cultivated Mushroom Lentinula edodes (Shiitake) Reveals Insights into Lignocellulose Degradation. PLoS One (2016) 0.75

Effect of method of deduplication on estimation of differential gene expression using RNA-seq. PeerJ (2017) 0.75

Isolation and characterization of centromeric repetitive DNA sequences in Saccharum spontaneum. Sci Rep (2017) 0.75

The Nephila clavipes genome highlights the diversity of spider silk genes and their complex expression. Nat Genet (2017) 0.75

Genetic heterogeneity of motor neuropathies. Neurology (2017) 0.75

The Evolutionary Dynamics of the Odorant Receptor Gene Family in Corbiculate Bees. Genome Biol Evol (2017) 0.75

The draft genome sequence of a desert tree Populus pruinosa. Gigascience (2017) 0.75

Comparative genomic analysis of Mycobacterium neoaurum MN2 and MN4 substrate and product tolerance. 3 Biotech (2017) 0.75

Genome sequencing of Metrosideros polymorpha (Myrtaceae), a dominant species in various habitats in the Hawaiian Islands with remarkable phenotypic variations. J Plant Res (2016) 0.75

Draft Genome Sequence of Deinococcus indicus DR1, a Novel Strain Isolated from a Freshwater Wetland. Genome Announc (2017) 0.75

Articles cited by this

Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol (2009) 235.12

The Sequence Alignment/Map format and SAMtools. Bioinformatics (2009) 232.39

Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics (2009) 190.94

Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res (2008) 151.16

De novo assembly of human genomes with massively parallel short read sequencing. Genome Res (2009) 45.91

Paired-end mapping reveals extensive structural variation in the human genome. Science (2007) 30.46

Scaffolding pre-assembled contigs using SSPACE. Bioinformatics (2010) 23.27

High-quality draft assemblies of mammalian genomes from massively parallel sequence data. Proc Natl Acad Sci U S A (2010) 22.97

The sequence and de novo assembly of the giant panda genome. Nature (2009) 15.76

High-throughput oncogene mutation profiling in human cancer. Nat Genet (2007) 12.68

An initial map of insertion and deletion (INDEL) variation in the human genome. Genome Res (2006) 11.60

Amplification-free Illumina sequencing-library preparation facilitates improved mapping and assembly of (G+C)-biased genomes. Nat Methods (2009) 10.41

Searching for SNPs with cloud computing. Genome Biol (2009) 10.12

Comparison of DNA sequences with protein sequences. Genomics (1997) 7.76

Performance comparison of exome DNA sequencing technologies. Nat Biotechnol (2011) 7.11

Population genomics of parallel adaptation in threespine stickleback using sequenced RAD tags. PLoS Genet (2010) 7.05

Discovery and genotyping of genome structural polymorphism by sequencing on a population scale. Nat Genet (2011) 6.67

Discovery of common Asian copy number variants using integrated high-resolution array CGH and massively parallel DNA sequencing. Nat Genet (2010) 3.49

Using the Acropora digitifera genome to understand coral responses to environmental change. Nature (2011) 3.26

Function annotation of the rice transcriptome at single-nucleotide resolution by RNA-seq. Genome Res (2010) 2.73

Quality control procedures for genome-wide association studies. Curr Protoc Hum Genet (2011) 2.48

A powerful and flexible statistical framework for testing hypotheses of allele-specific gene expression from RNA-seq data. Genome Res (2011) 2.00

Comparative analysis of Alu repeats in primate genomes. Genome Res (2009) 1.85

Mu transposon insertion sites and meiotic recombination events co-localize with epigenetic marks for open chromatin across the maize genome. PLoS Genet (2009) 1.70

A dominant mutation in RPE65 identified by whole-exome sequencing causes retinitis pigmentosa with choroidal involvement. Eur J Hum Genet (2011) 1.66

SEAL: a distributed short read mapping and duplicate removal tool. Bioinformatics (2011) 1.58

Efficient mapping and cloning of mutations in zebrafish by low-coverage whole-genome sequencing. Genetics (2011) 1.23

Illumina mate-paired DNA sequencing-library preparation using Cre-Lox recombination. Nucleic Acids Res (2011) 1.22

Fulcrum: condensing redundant reads from high-throughput sequencing studies. Bioinformatics (2012) 1.03

Articles by these authors

Comment on " 'Stemness': transcriptional profiling of embryonic and adult stem cells" and "a stem cell molecular signature". Science (2003) 8.79

Neuronal subtype-specific genes that control corticospinal motor neuron development in vivo. Neuron (2005) 7.28

Heart repair by reprogramming non-myocytes with cardiac transcription factors. Nature (2012) 6.65

Effect on left ventricular function of intracoronary transplantation of autologous bone marrow mesenchymal stem cell in patients with acute myocardial infarction. Am J Cardiol (2004) 5.82

Validation of the ITS2 region as a novel DNA barcode for identifying medicinal plant species. PLoS One (2010) 5.39

Gating of CFTR by the STAS domain of SLC26 transporters. Nat Cell Biol (2004) 3.43

Effectiveness of strengthened stimulation during acupuncture for the treatment of Bell palsy: a randomized controlled trial. CMAJ (2013) 3.15

Reprogramming of human fibroblasts toward a cardiac fate. Proc Natl Acad Sci U S A (2013) 3.14

Use of ITS2 region as the universal DNA barcode for plants and animals. PLoS One (2010) 2.92

miR-29 is a major regulator of genes associated with pulmonary fibrosis. Am J Respir Cell Mol Biol (2010) 2.87

De novo sequencing and analysis of the American ginseng root transcriptome using a GS FLX Titanium platform to discover putative genes involved in ginsenoside biosynthesis. BMC Genomics (2010) 2.41

Notch signaling controls the balance of ciliated and secretory cell fates in developing airways. Development (2009) 2.33

A molecular mechanism for aberrant CFTR-dependent HCO(3)(-) transport in cystic fibrosis. EMBO J (2002) 2.11

Histone deacetylase inhibition blunts ischemia/reperfusion injury by inducing cardiomyocyte autophagy. Circulation (2014) 2.07

MicroRNA-214 protects the mouse heart from ischemic injury by controlling Ca²⁺ overload and cell death. J Clin Invest (2012) 1.90

Airway PI3K pathway activation is an early and reversible event in lung cancer development. Sci Transl Med (2010) 1.84

Protease-activated receptor 2 exerts local protection and mediates some systemic complications in acute pancreatitis. Gastroenterology (2004) 1.72

Identification of medicinal plants in the family Fabaceae using a potential DNA barcode ITS2. J Ethnopharmacol (2010) 1.71

Inhibition of Tgf beta signaling by endogenous retinoic acid is essential for primary lung bud induction. Development (2007) 1.70

Distal left main coronary bifurcation lesions predict worse outcome in patients undergoing percutaneous implantation of drug-eluting stents: results from the Drug-Eluting Stent for the Treatment of Left Main Disease (DISTAL) Study. Cardiology (2009) 1.68

A susceptibility locus at chromosome 3p21 linked to familial nasopharyngeal carcinoma. Cancer Res (2004) 1.65

Metabolic stress-induced activation of FoxO1 triggers diabetic cardiomyopathy in mice. J Clin Invest (2012) 1.63

Authentication of the family Polygonaceae in Chinese pharmacopoeia by DNA barcoding technique. J Ethnopharmacol (2009) 1.62

Mitochondrial aberrations in mucolipidosis Type IV. J Biol Chem (2006) 1.61

Plant DNA barcoding: from gene to genome. Biol Rev Camb Philos Soc (2014) 1.56

miR-129 regulates cell proliferation by downregulating Cdk6 expression. Cell Cycle (2010) 1.51

Critical role of PIP5KI{gamma}87 in InsP3-mediated Ca(2+) signaling. J Cell Biol (2004) 1.51

Liver metastases in rats: chemoembolization combined with interstitial laser ablation for treatment. Radiology (2005) 1.48

SLC1A5 mediates glutamine transport required for lung cancer cell growth and survival. Clin Cancer Res (2012) 1.46

Recurrent DNMT3A R882 mutations in Chinese patients with acute myeloid leukemia and myelodysplastic syndrome. PLoS One (2011) 1.46

Comparison of right ventricular apex and right ventricular outflow tract septum pacing in the elderly with normal left ventricular ejection fraction: long-term follow-up. Kardiol Pol (2012) 1.46

Quantum rod bioconjugates as targeted probes for confocal and two-photon fluorescence imaging of cancer cells. Nano Lett (2007) 1.45

Gene expression profiling of nasopharyngeal carcinoma reveals the abnormally regulated Wnt signaling pathway. Hum Pathol (2006) 1.44

Physical and functional interaction between calcineurin and the cardiac L-type Ca2+ channel. Circ Res (2009) 1.42

MicroRNA miR-98 inhibits tumor angiogenesis and invasion by targeting activin receptor-like kinase-4 and matrix metalloproteinase-11. Oncotarget (2012) 1.40

The associations of Janus kinase-2 (JAK2) A830G polymorphism and the treatment outcomes in patients with acute myeloid leukemia. Leuk Lymphoma (2010) 1.40

The complete chloroplast genome sequence of the medicinal plant Salvia miltiorrhiza. PLoS One (2013) 1.38

Unilateral, multilevel, interlaminar fenestration in the removal of a multisegment cervical intramedullary ependymoma. Spine J (2013) 1.38

A new strategy for septal ablation with transendocardial ethanol injection using a multifunctional intracardiac echocardiography catheter: a feasibility study in canines. Catheter Cardiovasc Interv (2011) 1.37

Safety and pharmacokinetics of novel selective vascular endothelial growth factor receptor-2 inhibitor YN968D1 in patients with advanced malignancies. BMC Cancer (2010) 1.35

Ptr-miR397a is a negative regulator of laccase genes affecting lignin content in Populus trichocarpa. Proc Natl Acad Sci U S A (2013) 1.35

Reconstruction of an astigmatic hard X-ray beam and alignment of K-B mirrors from ptychographic coherent diffraction data. Opt Express (2010) 1.31

Extensive pyrosequencing reveals frequent intra-genomic variations of internal transcribed spacer regions of nuclear ribosomal DNA. PLoS One (2012) 1.31

Neuroepithelial body microenvironment is a niche for a distinct subset of Clara-like precursors in the developing airways. Proc Natl Acad Sci U S A (2012) 1.31

Genome sequence of the model medicinal mushroom Ganoderma lucidum. Nat Commun (2012) 1.30

Systemic bisperoxovanadium activates Akt/mTOR, reduces autophagy, and enhances recovery following cervical spinal cord injury. PLoS One (2012) 1.30

Improvement of cardiac function after transplantation of autologous bone marrow mesenchymal stem cells in patients with acute myocardial infarction. Chin Med J (Engl) (2004) 1.28

Transcriptome analysis reveals ginsenosides biosynthetic genes, microRNAs and simple sequence repeats in Panax ginseng C. A. Meyer. BMC Genomics (2013) 1.28

Genome-wide identification and characterization of novel genes involved in terpenoid biosynthesis in Salvia miltiorrhiza. J Exp Bot (2012) 1.26

Mapping the potential distribution of high artemisinin-yielding Artemisia annua L. (Qinghao) in China with a geographic information system. Chin Med (2010) 1.26

Differential expression of components of the microRNA machinery during mouse organogenesis. Biochem Biophys Res Commun (2005) 1.25

MicroRNA miR-24 enhances tumor invasion and metastasis by targeting PTPN9 and PTPRF to promote EGF signaling. J Cell Sci (2013) 1.25

Distinct roles for retinoic acid receptors alpha and beta in early lung morphogenesis. Dev Biol (2006) 1.24

Moderate traumatic brain injury causes acute dendritic and synaptic degeneration in the hippocampal dentate gyrus. PLoS One (2011) 1.23

Selective death of newborn neurons in hippocampal dentate gyrus following moderate experimental traumatic brain injury. J Neurosci Res (2008) 1.23

Evaluating the feasibility of using candidate DNA barcodes in discriminating species of the large Asteraceae family. BMC Evol Biol (2010) 1.23

Epidemiology and control of human granulocytic anaplasmosis: a systematic review. Vector Borne Zoonotic Dis (2012) 1.21

Homer 2 tunes G protein-coupled receptors stimulus intensity by regulating RGS proteins and PLCbeta GAP activities. J Cell Biol (2003) 1.20

Construction of VNP20009: a novel, genetically stable antibiotic-sensitive strain of tumor-targeting Salmonella for parenteral administration in humans. Methods Mol Med (2004) 1.20

A retinoic acid-dependent network in the foregut controls formation of the mouse lung primordium. J Clin Invest (2010) 1.18

Moderate traumatic brain injury promotes proliferation of quiescent neural progenitors in the adult hippocampus. Exp Neurol (2009) 1.16

Using 915 nm laser excited Tm³+/Er³+/Ho³+- doped NaYbF4 upconversion nanoparticles for in vitro and deeper in vivo bioimaging without overheating irradiation. ACS Nano (2011) 1.16

Comparison of 454-ESTs from Huperzia serrata and Phlegmariurus carinatus reveals putative genes involved in lycopodium alkaloid biosynthesis and developmental regulation. BMC Plant Biol (2010) 1.16

Identification of FGF10 targets in the embryonic lung epithelium during bud morphogenesis. J Biol Chem (2004) 1.16

Neuroprotection against traumatic brain injury by a peptide derived from the collapsin response mediator protein 2 (CRMP2). J Biol Chem (2011) 1.15

The short ITS2 sequence serves as an efficient taxonomic sequence tag in comparison with the full-length ITS. Biomed Res Int (2013) 1.14

IDH1 and IDH2 mutation analysis in Chinese patients with acute myeloid leukemia and myelodysplastic syndrome. Ann Hematol (2011) 1.14

DNA barcode goes two-dimensions: DNA QR code web server. PLoS One (2012) 1.12

Ebola Virus Outbreak Investigation, Sierra Leone, September 28-November 11, 2014. Emerg Infect Dis (2015) 1.10

AE4 is a DIDS-sensitive Cl(-)/HCO(-)(3) exchanger in the basolateral membrane of the renal CCD and the SMG duct. Am J Physiol Cell Physiol (2002) 1.10

Analysis of the transcriptome of Panax notoginseng root uncovers putative triterpene saponin-biosynthetic genes and genetic markers. BMC Genomics (2011) 1.09

Using DNA barcoding to identify species within Euphorbiaceae. Planta Med (2010) 1.09

Integrative genomics analysis identifies candidate drivers at 3q26-29 amplicon in squamous cell carcinoma of the lung. Clin Cancer Res (2013) 1.08

Inhibition of the tumor necrosis factor-alpha pathway is radioprotective for the lung. Clin Cancer Res (2008) 1.07

Imaging pancreatic cancer using surface-functionalized quantum dots. J Phys Chem B (2007) 1.06

Analysis of expressed sequence tags from the Huperzia serrata leaf for gene discovery in the areas of secondary metabolite biosynthesis and development regulation. Physiol Plant (2009) 1.06

Review of the botanical characteristics, phytochemistry, and pharmacology of Astragalus membranaceus (Huangqi). Phytother Res (2014) 1.05

Utility of the trnH-psbA intergenic spacer region and its combinations as plant DNA barcodes: a meta-analysis. PLoS One (2012) 1.05

Polysaccharides from the root of Angelica sinensis promotes hematopoiesis and thrombopoiesis through the PI3K/AKT pathway. BMC Complement Altern Med (2010) 1.05