Base-calling of automated sequencer traces using phred. I. Accuracy assessment.

PubWeight™: 96.63‹?› | Rank: Top 0.01% | All-Time Top 1000

🔗 View Article (PMID 9521921)

Published in Genome Res on March 01, 1998

Authors

B Ewing1, L Hillier, M C Wendl, P Green

Author Affiliations

1: Department of Molecular Biotechnology, University of Washington, Seattle, Washington 98195-7730, USA.

Articles citing this

(truncated to the top 100)

Mapping short DNA sequencing reads and calling variants using mapping quality scores. Genome Res (2008) 157.44

Genome sequencing in microfabricated high-density picolitre reactors. Nature (2005) 150.21

SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol (2012) 62.36

CAP3: A DNA sequence assembly program. Genome Res (1999) 50.04

The diploid genome sequence of an individual human. PLoS Biol (2007) 44.80

Using quality scores and longer reads improves accuracy of Solexa read mapping. BMC Bioinformatics (2008) 39.08

Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat Methods (2013) 31.15

ARACHNE: a whole-genome shotgun assembler. Genome Res (2002) 22.72

Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium. Am J Hum Genet (2003) 21.52

The pervasive effects of an antibiotic on the human gut microbiota, as revealed by deep 16S rRNA sequencing. PLoS Biol (2008) 16.57

The ribosomal database project (RDP-II): introducing myRDP space and quality controlled public data. Nucleic Acids Res (2006) 16.26

SNP detection for massively parallel whole-genome resequencing. Genome Res (2009) 15.96

The phusion assembler. Genome Res (2003) 15.25

The genome sequence of Caenorhabditis briggsae: a platform for comparative genomics. PLoS Biol (2003) 13.32

Ironing out the wrinkles in the rare biosphere through improved OTU clustering. Environ Microbiol (2010) 13.19

The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants. Nucleic Acids Res (2009) 12.09

Complete MHC haplotype sequencing for common disease gene mapping. Genome Res (2004) 12.09

Comprehensive transposon mutant library of Pseudomonas aeruginosa. Proc Natl Acad Sci U S A (2003) 10.63

Quality scores and SNP detection in sequencing-by-synthesis systems. Genome Res (2008) 10.49

SAGEmap: a public gene expression resource. Genome Res (2000) 10.41

The Atlas genome assembly system. Genome Res (2004) 9.78

Complete genome sequence of an M1 strain of Streptococcus pyogenes. Proc Natl Acad Sci U S A (2001) 9.40

The BDGP gene disruption project: single transposon insertions associated with 40% of Drosophila genes. Genetics (2004) 8.40

Comparative genome sequencing of Drosophila pseudoobscura: chromosomal, gene, and cis-element evolution. Genome Res (2005) 8.38

Assemblathon 1: a competitive assessment of de novo short read assembly methods. Genome Res (2011) 8.38

Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster euchromatic genome sequence. Genome Biol (2002) 8.07

Identification of a novel polyomavirus from patients with acute respiratory tract infections. PLoS Pathog (2007) 8.01

Automated finishing with autofinish. Genome Res (2001) 7.97

Genome of the bacterium Streptococcus pneumoniae strain R6. J Bacteriol (2001) 7.97

A novel CpG island set identifies tissue-specific methylation at developmental gene loci. PLoS Biol (2008) 7.95

A high-throughput Arabidopsis reverse genetics system. Plant Cell (2002) 7.58

novoSNP, a novel computational tool for sequence variation discovery. Genome Res (2005) 7.42

Interactome networks and human disease. Cell (2011) 7.16

Critical factors for assembling a high volume of DNA barcodes. Philos Trans R Soc Lond B Biol Sci (2005) 7.13

Hd1, a major photoperiod sensitivity quantitative trait locus in rice, is closely related to the Arabidopsis flowering time gene CONSTANS. Plant Cell (2000) 6.93

The ClinSeq Project: piloting large-scale genome sequencing for research in genomic medicine. Genome Res (2009) 6.83

Visualizing genomes: techniques and challenges. Nat Methods (2010) 6.66

The Drosophila gene collection: identification of putative full-length cDNAs for 70% of D. melanogaster genes. Genome Res (2002) 6.61

Genome sequence and comparative analysis of the solvent-producing bacterium Clostridium acetobutylicum. J Bacteriol (2001) 6.46

BioJava: an open-source framework for bioinformatics. Bioinformatics (2008) 6.17

454 sequencing put to the test using the complex genome of barley. BMC Genomics (2006) 6.12

Genome sequence of Halobacterium species NRC-1. Proc Natl Acad Sci U S A (2000) 5.87

Pathways of carbon assimilation and ammonia oxidation suggested by environmental genomic analyses of marine Crenarchaeota. PLoS Biol (2006) 5.77

Fusobacterium nucleatum infection is prevalent in human colorectal carcinoma. Genome Res (2011) 5.58

The promise of a DNA taxonomy. Philos Trans R Soc Lond B Biol Sci (2004) 5.51

High-throughput variation detection and genotyping using microarrays. Genome Res (2001) 5.24

A genetic variation map for chicken with 2.8 million single-nucleotide polymorphisms. Nature (2004) 5.24

Many novel mammalian microRNA candidates identified by extensive cloning and RAKE analysis. Genome Res (2006) 5.23

Major structural differences and novel potential virulence mechanisms from the genomes of multiple campylobacter species. PLoS Biol (2005) 5.22

Microbes on the human vaginal epithelium. Proc Natl Acad Sci U S A (2005) 5.15

Whole genome comparisons of serotype 4b and 1/2a strains of the food-borne pathogen Listeria monocytogenes reveal new insights into the core genome components of this species. Nucleic Acids Res (2004) 4.97

High-density microarray of small-subunit ribosomal DNA probes. Appl Environ Microbiol (2002) 4.95

RNA viral community in human feces: prevalence of plant pathogenic viruses. PLoS Biol (2006) 4.93

Genome sequence of Shigella flexneri 2a: insights into pathogenicity through comparison with genomes of Escherichia coli K12 and O157. Nucleic Acids Res (2002) 4.92

Comparing sequenced segments of the tomato and Arabidopsis genomes: large-scale duplication followed by selective gene loss creates a network of synteny. Proc Natl Acad Sci U S A (2000) 4.90

Statistical evaluation of alternative models of human evolution. Proc Natl Acad Sci U S A (2007) 4.88

BayGenomics: a resource of insertional mutations in mouse embryonic stem cells. Nucleic Acids Res (2003) 4.83

Inhibitors of factor VIII in black patients with hemophilia. N Engl J Med (2009) 4.82

High-density universal 16S rRNA microarray analysis reveals broader diversity than typical clone library when sampling the environment. Microb Ecol (2007) 4.81

Genomic regions exhibiting positive selection identified from dense genotype data. Genome Res (2005) 4.81

Who ate whom? Adaptive Helicobacter genomic changes that accompanied a host jump from early humans to large felines. PLoS Genet (2006) 4.65

Urban aerosols harbor diverse and dynamic bacterial populations. Proc Natl Acad Sci U S A (2006) 4.62

Simultaneous assessment of soil microbial community structure and function through analysis of the meta-transcriptome. PLoS One (2008) 4.54

Genome assembly reborn: recent computational challenges. Brief Bioinform (2009) 4.53

Genome analysis of the smallest free-living eukaryote Ostreococcus tauri unveils many unique features. Proc Natl Acad Sci U S A (2006) 4.52

Deep resequencing reveals excess rare recent variants consistent with explosive population growth. Nat Commun (2010) 4.39

An intermediate grade of finished genomic sequence suitable for comparative analyses. Genome Res (2004) 4.38

Coding potential of laboratory and clinical strains of human cytomegalovirus. Proc Natl Acad Sci U S A (2003) 4.37

RePS: a sequence assembler that masks exact repeats identified from the shotgun data. Genome Res (2002) 4.35

Close split of sorghum and maize genome progenitors. Genome Res (2004) 4.34

A bioinformatician's guide to metagenomics. Microbiol Mol Biol Rev (2008) 4.33

Natural genetic variation caused by transposable elements in humans. Genetics (2004) 4.33

Mining SNPs from EST databases. Genome Res (1999) 4.31

Metagenomic analysis of human diarrhea: viral detection and discovery. PLoS Pathog (2008) 4.29

The Wolbachia genome of Brugia malayi: endosymbiont evolution within a human pathogenic nematode. PLoS Biol (2005) 4.28

Unique features revealed by the genome sequence of Acinetobacter sp. ADP1, a versatile and naturally transformation competent bacterium. Nucleic Acids Res (2004) 4.19

The Schistosoma japonicum genome reveals features of host-parasite interplay. Nature (2009) 4.19

Protein interaction mapping: a Drosophila case study. Genome Res (2005) 4.15

Widespread genome duplications throughout the history of flowering plants. Genome Res (2006) 4.07

Analysis of a clonal lineage of HIV-1 envelope V2/V3 conformational epitope-specific broadly neutralizing antibodies and their inferred unmutated common ancestors. J Virol (2011) 4.06

Novel insights into the genomic basis of citrus canker based on the genome sequences of two strains of Xanthomonas fuscans subsp. aurantifolii. BMC Genomics (2010) 4.03

Gene discovery using computational and microarray analysis of transcription in the Drosophila melanogaster testis. Genome Res (2000) 3.98

SAFA: semi-automated footprinting analysis software for high-throughput quantification of nucleic acid footprinting experiments. RNA (2005) 3.95

Virus discovery by deep sequencing and assembly of virus-derived small silencing RNAs. Proc Natl Acad Sci U S A (2010) 3.92

Long-range heterogeneity at the 3' ends of human mRNAs. Genome Res (2002) 3.91

The genome sequence of the probiotic intestinal bacterium Lactobacillus johnsonii NCC 533. Proc Natl Acad Sci U S A (2004) 3.88

Finished bacterial genomes from shotgun sequence data. Genome Res (2012) 3.86

Distinct patterns of mutations occurring in de novo AML versus AML arising in the setting of severe congenital neutropenia. Blood (2007) 3.73

Complete genome sequence of the ammonia-oxidizing bacterium and obligate chemolithoautotroph Nitrosomonas europaea. J Bacteriol (2003) 3.72

The complete genome of hyperthermophile Methanopyrus kandleri AV19 and monophyly of archaeal methanogens. Proc Natl Acad Sci U S A (2002) 3.72

CREBBP mutations in relapsed acute lymphoblastic leukaemia. Nature (2011) 3.72

Widespread genomic signatures of natural selection in hominid evolution. PLoS Genet (2009) 3.68

A Drosophila full-length cDNA resource. Genome Biol (2002) 3.67

Assembling large genomes with single-molecule sequencing and locality-sensitive hashing. Nat Biotechnol (2015) 3.66

Comparative genomic analyses of seventeen Streptococcus pneumoniae strains: insights into the pneumococcal supragenome. J Bacteriol (2007) 3.62

APOBEC3 proteins mediate the clearance of foreign DNA from human cells. Nat Struct Mol Biol (2010) 3.61

A mutation in the myostatin gene increases muscle mass and enhances racing performance in heterozygote dogs. PLoS Genet (2007) 3.61

The dawn of human matrilineal diversity. Am J Hum Genet (2008) 3.60

Genome dynamics and diversity of Shigella species, the etiologic agents of bacillary dysentery. Nucleic Acids Res (2005) 3.56

A rare penetrant mutation in CFH confers high risk of age-related macular degeneration. Nat Genet (2011) 3.55

Articles by these authors

Initial sequencing and analysis of the human genome. Nature (2001) 212.86

Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res (1998) 106.16

Consed: a graphical tool for sequence finishing. Genome Res (1998) 59.36

MAPMAKER: an interactive computer package for constructing primary genetic linkage maps of experimental and natural populations. Genomics (1987) 54.39

A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms. Nature (2001) 42.18

Construction of multilocus genetic linkage maps in humans. Proc Natl Acad Sci U S A (1987) 35.83

Global water resources: vulnerability from climate change and population growth. Science (2000) 16.61

A general approach to single-nucleotide polymorphism discovery. Nat Genet (1999) 13.39

A genetic linkage map of the human genome. Cell (1987) 13.37

Comparative analyses of multi-species sequences from targeted genomic regions. Nature (2003) 13.31

A physical map of the human genome. Nature (2001) 12.39

Global threats to human water security and river biodiversity. Nature (2010) 11.84

Automated finishing with autofinish. Genome Res (2001) 7.97

Genes galore: a summary of methods for accessing results from large-scale partial sequencing of anonymous Arabidopsis cDNA clones. Plant Physiol (1994) 7.42

Analysis of expressed sequence tags indicates 35,000 human genes. Nat Genet (2000) 6.11

The C. elegans genome sequencing project: a beginning. Nature (1992) 5.36

Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana. Nature (1999) 5.03

OSP: a computer program for choosing PCR and DNA sequencing primers. PCR Methods Appl (1991) 4.72

A survey of expressed genes in Caenorhabditis elegans. Nat Genet (1992) 4.63

Vero cell toxins in Escherichia coli and related bacteria: transfer by phage and conjugation and toxic action in laboratory animals, chickens and pigs. J Gen Microbiol (1983) 4.52

Identification of p53 gene mutations in bladder cancers and urine samples. Science (1991) 3.68

Mapping the gene for hereditary cutaneous malignant melanoma-dysplastic nevus to chromosome 1p. N Engl J Med (1989) 3.03

Ancient conserved regions in new gene sequences and the protein databases. Science (1993) 3.00

Changes in gene expression associated with developmental arrest and longevity in Caenorhabditis elegans. Genome Res (2001) 2.96

Genetic linkage map of human chromosome 7 with 63 DNA markers. Proc Natl Acad Sci U S A (1987) 2.91

An encyclopedia of mouse genes. Nat Genet (1999) 2.48

zA map for sequence analysis of the Arabidopsis thaliana genome. Nat Genet (1999) 2.39

Overlapping genomic sequences: a treasure trove of single-nucleotide polymorphisms. Genome Res (1998) 2.24

Detecting patterns in protein sequences. J Mol Biol (1994) 2.16

Gene discovery by EST sequencing in Toxoplasma gondii reveals sequences restricted to the Apicomplexa. Genome Res (1998) 2.14

World Association for the Advancement of Veterinary Parasitology (W.A.A.V.P.) guidelines for evaluating the efficacy of parasiticides for the treatment, prevention and control of flea and tick infestation on dogs and cats. Vet Parasitol (2006) 2.09

Multipoint linkage analysis in neurofibromatosis type I: an international collaboration. Am J Hum Genet (1989) 2.04

Interaction of cAMP receptor protein with the ompA gene, a gene for a major outer membrane protein of Escherichia coli. FEBS Lett (1981) 1.94

Oligonucleotide probe for detection and identification of Campylobacter pylori. J Clin Microbiol (1989) 1.90

The nucleotide sequence of Saccharomyces cerevisiae chromosome XII. Nature (1997) 1.89

Evidence that exogenous and endogenous fractalkine can induce spinal nociceptive facilitation in rats. Eur J Neurosci (2004) 1.89

Idiopathic mouth ulcers in sheep, cattle and horses. Vet Rec (2001) 1.80

Representation of cloned genomic sequences in two sequencing vectors: correlation of DNA sequence and subclone distribution. Nucleic Acids Res (1997) 1.78

The novel endocannabinoid receptor GPR55 is activated by atypical cannabinoids but does not mediate their vasodilator effects. Br J Pharmacol (2007) 1.74

Sequence-tagged site (STS) content mapping of human chromosomes: theoretical considerations and early experiences. PCR Methods Appl (1991) 1.70

Identification of candidate coding region single nucleotide polymorphisms in 165 human genes using assembled expressed sequence tags. Genome Res (1999) 1.70

Scan of human genome reveals no new Loci under ancient balancing selection. Genetics (2006) 1.68

Thermosensitive H1 plasmids determining citrate utilization. J Gen Microbiol (1978) 1.62

Control of early neurogenesis of the Drosophila brain by the head gap genes tll, otd, ems, and btd. Dev Biol (1997) 1.57

Expressed sequence tag analysis of the bradyzoite stage of Toxoplasma gondii: identification of developmentally regulated genes. Infect Immun (1998) 1.55

Automated sequence preprocessing in a large-scale sequencing environment. Genome Res (1998) 1.53

A trace display and editing program for data from fluorescence based sequencing machines. Nucleic Acids Res (1991) 1.50

AIDS-related cholangiopancreatographic changes. Abdom Imaging (1994) 1.46

Amnioinfusion in very early preterm prelabor rupture of membranes (AMIPROM): pregnancy, neonatal and maternal outcomes in a randomized controlled pilot study. Ultrasound Obstet Gynecol (2014) 1.45

Thermosensitive antibiotic resistance plasmids in enterobacteria. J Gen Microbiol (1978) 1.40

Intestinal obstruction with hemp bedding. Vet Rec (1996) 1.39

Expressed sequence tags--ESTablishing bridges between genomes. Trends Genet (1998) 1.38

An initial investigation of spinal mechanisms underlying pain enhancement induced by fractalkine, a neuronally released chemokine. Eur J Neurosci (2005) 1.36

Possible benefit of intravenous immunoglobulin therapy in a lung transplant recipient with West Nile virus encephalitis. Transpl Infect Dis (2002) 1.34

A polycistronic mRNA specified by the coronavirus infectious bronchitis virus. Virology (1991) 1.32

Regional and physical mapping studies characterizing the Greig polysyndactyly 3;7 chromosome translocation, t(3;7)(p21.1;p13). Genomics (1989) 1.27

Expression of bacteriophage T7 RNA polymerase in avian and mammalian cells by a recombinant fowlpox virus. J Gen Virol (1996) 1.27

Sequence assembly with CAFTOOLS. Genome Res (1998) 1.26

Molecular confirmation of bacillus Calmette-Guérin as the cause of pulmonary infection following urinary tract instillation. Clin Infect Dis (1993) 1.26

Regional localization of the autosomal dominant polycystic kidney disease locus. Genomics (1988) 1.26

Early neurogenesis of the Drosophila brain. J Comp Neurol (1996) 1.24

Two inhibitors of pro-inflammatory cytokine release, interleukin-10 and interleukin-4, have contrasting effects on release of soluble p75 tumor necrosis factor receptor by cultured monocytes. Eur J Immunol (1994) 1.21

Symptom expectation after minor head injury. A comparative study between Canada and Lithuania. Clin Neurol Neurosurg (2001) 1.21

The 131-amino-acid repeat region of the essential 39-kilodalton core protein of fowlpox virus FP9, equivalent to vaccinia virus A4L protein, is nonessential and highly immunogenic. J Virol (1998) 1.14

Investigation of the factor VIII intron 22 repeated region (int22h) and the associated inversion junctions. Hum Mol Genet (1995) 1.13

The homozygous complete hydatidiform mole: a unique resource for genome studies. Genomics (1997) 1.13

The role of fractalkine in the recruitment of monocytes to the endothelium. Eur J Pharmacol (2000) 1.11

Maternal depression and medication exposure during pregnancy: comparison of maternal retrospective recall to prospective documentation. BJOG (2008) 1.10

Selecting breast cancer patients for chemotherapy: the opening of the UK OPTIMA trial. Clin Oncol (R Coll Radiol) (2012) 1.10

Highly polymorphic RFLP probes as diagnostic tools. Cold Spring Harb Symp Quant Biol (1986) 1.09

Forensic DNA tests and hardy-weinberg equilibrium. Science (1991) 1.09

The construction and analysis of M13 libraries prepared from YAC DNA. Nucleic Acids Res (1995) 1.07

Identification and characterization of 23 RFLP loci by screening random cosmid genomic clones. Am J Hum Genet (1989) 1.07

Genomic DNA sequencing methods. Methods Cell Biol (1995) 1.06

'That's the problem with living in a small town': privacy and sexual health issues for young rural people. Aust J Rural Health (1997) 1.06

Criterion for the completeness of large-scale physical maps of DNA. Cold Spring Harb Symp Quant Biol (1993) 1.05

Linkage studies with chromosome 17 DNA markers in 45 neurofibromatosis 1 families. Genomics (1987) 1.05

Effort testing in patients with fibromyalgia and disability incentives. J Rheumatol (2001) 1.05

Differences in the ability of human T-cell lymphotropic virus type 1 (HTLV-1) and HTLV-2 tax to inhibit p53 function. J Virol (2000) 1.04

Body image satisfaction, dieting beliefs, and weight loss behaviors in adolescent girls and boys. J Youth Adolesc (1991) 1.04

Reinforcement of vocal correlates of auditory hallucinations by auditory feedback: a case study. Br J Psychiatry (1981) 1.03

A 12 megabase restriction map at the cystic fibrosis locus. Nucleic Acids Res (1989) 1.01

Production of cytokines, vascular endothelial growth factor, matrix metalloproteinases, and tissue inhibitor of metalloproteinases 1 by tenosynovium demonstrates its potential for tendon destruction in rheumatoid arthritis. Arthritis Rheum (2001) 1.01

Obstetric skills drills: evaluation of teaching methods. Nurse Educ Today (2007) 0.99

Hemiparasitic plant impacts animal and plant communities across four trophic levels. Ecology (2015) 0.99

Complex regulatory region mediating tailless expression in early embryonic patterning and brain development. Development (1997) 0.98

S100A8 triggers oxidation-sensitive repulsion of neutrophils. J Dent Res (2006) 0.98

Reactive carbonyl formation by oxidative and non-oxidative pathways. Front Biosci (2001) 0.98

Single nucleotide polymorphism hunting in cyberspace. Hum Mutat (1998) 0.98

Vagus nerve stimulation for epilepsy: randomized comparison of three stimulation paradigms. Neurology (2005) 0.98

Learning for a multicultural society. Br J Gen Pract (1996) 0.97

Chromosomal localization of ZFX--a human gene that escapes X inactivation--and its murine homologs. Genomics (1990) 0.97

Genetic linkage map of 46 DNA markers on human chromosome 16. Proc Natl Acad Sci U S A (1990) 0.97

Identifying the impact of diabetes research. Diabet Med (2006) 0.97

Sequence of human glucose-6-phosphate dehydrogenase cloned in plasmids and a yeast artificial chromosome. Genomics (1991) 0.97

Lane tracking software for four-color fluorescence-based electrophoretic gel images. Genome Res (1996) 0.96