Basic local alignment search tool.

PubWeight™: 659.07‹?› | Rank: Top 0.01% | All-Time Top 10

🔗 View Article (PMID 2231712)

Published in J Mol Biol on October 05, 1990


S F Altschul1, W Gish, W Miller, E W Myers, D J Lipman

Author Affiliations

1: National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894.

Articles citing this

(truncated to the top 100)

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res (1997) 665.31

tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res (1997) 142.55

BLAT--the BLAST-like alignment tool. Genome Res (2002) 126.78

RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res (2007) 85.81

Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci U S A (1992) 61.33

Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl Environ Microbiol (2006) 57.72

Fiji: an open-source platform for biological-image analysis. Nat Methods (2012) 56.92

Human non-synonymous SNPs: server and survey. Nucleic Acids Res (2002) 50.45

CAP3: A DNA sequence assembly program. Genome Res (1999) 50.04

Versatile and open software for comparing large genomes. Genome Biol (2004) 49.45

SSAHA: a fast search method for large DNA databases. Genome Res (2001) 48.64

NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res (2006) 48.10

RefSeq and LocusLink: NCBI gene-centered resources. Nucleic Acids Res (2001) 45.29

Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics (2010) 42.80

The Pfam protein families database. Nucleic Acids Res (2000) 42.28

Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol (2010) 39.63

Using quality scores and longer reads improves accuracy of Solexa read mapping. BMC Bioinformatics (2008) 39.08

NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res (2005) 37.39

BLAST+: architecture and applications. BMC Bioinformatics (2009) 36.53

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2000) 34.79

Human MicroRNA targets. PLoS Biol (2004) 34.51

ZOOM! Zillions of oligos mapped. Bioinformatics (2008) 29.78

KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic Acids Res (2007) 29.46

The comparative RNA web (CRW) site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs. BMC Bioinformatics (2002) 29.41

Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics (2012) 26.90

NCBI Reference Sequences: current status, policy and new initiatives. Nucleic Acids Res (2008) 26.04

SWISS-MODEL: An automated protein homology-modeling server. Nucleic Acids Res (2003) 25.86

Large-scale analysis of the human and mouse transcriptomes. Proc Natl Acad Sci U S A (2002) 25.47

MEGAN analysis of metagenomic data. Genome Res (2007) 25.29

bold: The Barcode of Life Data System ( Mol Ecol Notes (2007) 25.13

Hypoxia-inducible factor 1 is a basic-helix-loop-helix-PAS heterodimer regulated by cellular O2 tension. Proc Natl Acad Sci U S A (1995) 24.77

Automated generation of heuristics for biological sequence comparison. BMC Bioinformatics (2005) 24.54

GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions. Nucleic Acids Res (2001) 24.24

Protein sequence similarity searches using patterns as seeds. Nucleic Acids Res (1998) 23.87

NAST: a multiple sequence alignment server for comparative analysis of 16S rRNA genes. Nucleic Acids Res (2006) 23.67

The Sorcerer II Global Ocean Sampling expedition: northwest Atlantic through eastern tropical Pacific. PLoS Biol (2007) 23.58

LAGAN and Multi-LAGAN: efficient tools for large-scale multiple alignment of genomic DNA. Genome Res (2003) 23.03

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2005) 22.98

A computer program for aligning a cDNA sequence with a genomic DNA sequence. Genome Res (1998) 22.69

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2007) 22.53

Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements. Nucleic Acids Res (2001) 22.33

The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res (2005) 21.68

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2008) 21.36

PyCogent: a toolkit for making sense from sequence. Genome Biol (2007) 20.64

CloudBurst: highly sensitive read mapping with MapReduce. Bioinformatics (2009) 20.45

SGD: Saccharomyces Genome Database. Nucleic Acids Res (1998) 20.26

Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample. Proc Natl Acad Sci U S A (2010) 19.48

Database resources of the National Center for Biotechnology Information: 2002 update. Nucleic Acids Res (2002) 19.40

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2001) 19.13 robust phylogenetic analysis for the non-specialist. Nucleic Acids Res (2008) 19.09

Development of the human infant intestinal microbiota. PLoS Biol (2007) 18.91

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2006) 18.85

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2006) 18.84

Toward an online repository of Standard Operating Procedures (SOPs) for (meta)genomic annotation. OMICS (2008) 18.69

Detection of conserved segments in proteins: iterative scanning of sequence databases with alignment blocks. Proc Natl Acad Sci U S A (1994) 18.46

Database resources of the National Center for Biotechnology. Nucleic Acids Res (2003) 18.26

The Sequence Ontology: a tool for the unification of genome annotations. Genome Biol (2005) 18.20

A survey of sequence alignment algorithms for next-generation sequencing. Brief Bioinform (2010) 18.05

NCBI GEO: mining millions of expression profiles--database and tools. Nucleic Acids Res (2005) 17.61

PipMaker--a web server for aligning two genomic DNA sequences. Genome Res (2000) 17.46

Fast algorithms for large-scale genome alignment and comparison. Nucleic Acids Res (2002) 17.31

SeqAn an efficient, generic C++ library for sequence analysis. BMC Bioinformatics (2008) 17.31

Type III protein secretion systems in bacterial pathogens of animals and plants. Microbiol Mol Biol Rev (1998) 17.27

PyNAST: a flexible tool for aligning sequences to a template alignment. Bioinformatics (2009) 15.93

The EMBL nucleotide sequence database. Nucleic Acids Res (2000) 15.57

BLAST: at the core of a powerful and diverse set of sequence analysis tools. Nucleic Acids Res (2004) 15.43

The GENCODE v7 catalog of human long noncoding RNAs: analysis of their gene structure, evolution, and expression. Genome Res (2012) 15.41

The phusion assembler. Genome Res (2003) 15.25

Accelerated Profile HMM Searches. PLoS Comput Biol (2011) 15.22

NCBI BLAST: a better web interface. Nucleic Acids Res (2008) 15.14

CARPEL FACTORY, a Dicer homolog, and HEN1, a novel protein, act in microRNA metabolism in Arabidopsis thaliana. Curr Biol (2002) 15.13

Development of human protein reference database as an initial platform for approaching systems biology in humans. Genome Res (2003) 14.79

RNA sequence analysis using covariance models. Nucleic Acids Res (1994) 14.60

NetAffx: Affymetrix probesets and annotations. Nucleic Acids Res (2003) 14.37

SCOP: a structural classification of proteins database. Nucleic Acids Res (2000) 14.14

Slider--maximum use of probability information for alignment of short sequence reads and SNP detection. Bioinformatics (2008) 14.04

The Sorcerer II Global Ocean Sampling expedition: expanding the universe of protein families. PLoS Biol (2007) 13.99

Extensive mosaic structure revealed by the complete genome sequence of uropathogenic Escherichia coli. Proc Natl Acad Sci U S A (2002) 13.83

High-throughput functional annotation and data mining with the Blast2GO suite. Nucleic Acids Res (2008) 13.51

ProbeMatch: rapid alignment of oligonucleotides to genome allowing both gaps and mismatches. Bioinformatics (2009) 13.34

The Jpred 3 secondary structure prediction server. Nucleic Acids Res (2008) 13.32

BASys: a web server for automated bacterial genome annotation. Nucleic Acids Res (2005) 13.30

Ironing out the wrinkles in the rare biosphere through improved OTU clustering. Environ Microbiol (2010) 13.19

HMMER web server: interactive sequence similarity searching. Nucleic Acids Res (2011) 13.00

Nucleotide sequence of the Kaposi sarcoma-associated herpesvirus (HHV8). Proc Natl Acad Sci U S A (1996) 12.89

DAVID Knowledgebase: a gene-centered database integrating heterogeneous gene annotation resources to facilitate high-throughput gene functional analysis. BMC Bioinformatics (2007) 12.76

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2009) 12.51

The Ensembl automatic gene annotation system. Genome Res (2004) 12.24

Complete genome sequence of Methanobacterium thermoautotrophicum deltaH: functional analysis and comparative genomics. J Bacteriol (1997) 12.21

Applications and statistics for multiple high-scoring segments in molecular sequences. Proc Natl Acad Sci U S A (1993) 12.10

The EMBL Nucleotide Sequence Database. Nucleic Acids Res (2002) 12.05

Genome annotation assessment in Drosophila melanogaster. Genome Res (2000) 11.77

HGBASE: a database of SNPs and other variations in and around human genes. Nucleic Acids Res (2000) 11.75

Personalized copy number and segmental duplication maps using next-generation sequencing. Nat Genet (2009) 11.73

Widespread RNA and DNA sequence differences in the human transcriptome. Science (2011) 11.45

Embedding strategies for effective use of information from multiple sequence alignments. Protein Sci (1997) 11.25

Novel division level bacterial diversity in a Yellowstone hot spring. J Bacteriol (1998) 11.22

The National Microbial Pathogen Database Resource (NMPDR): a genomics platform based on subsystem annotation. Nucleic Acids Res (2006) 11.15

Annotating large genomes with exact word matches. Genome Res (2003) 11.07

PubChem: a public information system for analyzing bioactivities of small molecules. Nucleic Acids Res (2009) 11.05

Articles by these authors

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res (1997) 665.31

Improved tools for biological sequence comparison. Proc Natl Acad Sci U S A (1988) 193.60

The sequence of the human genome. Science (2001) 101.55

Rapid and sensitive protein similarity searches. Science (1985) 76.83

The genome sequence of Drosophila melanogaster. Science (2000) 74.32

Rapid similarity searches of nucleic acid and protein data banks. Proc Natl Acad Sci U S A (1983) 53.12

A genomic perspective on protein families. Science (1997) 50.51

A greedy algorithm for aligning DNA sequences. J Comput Biol (2000) 47.89

Optimal alignments in linear space. Comput Appl Biosci (1988) 38.10

GenBank. Nucleic Acids Res (2000) 36.75

Protein sequence similarity searches using patterns as seeds. Nucleic Acids Res (1998) 23.87

Amino acid substitution matrices from an information theoretic perspective. J Mol Biol (1991) 23.38

The UCSC Genome Browser Database: 2008 update. Nucleic Acids Res (2007) 23.13

A computer program for aligning a cDNA sequence with a genomic DNA sequence. Genome Res (1998) 22.69

GenBank. Nucleic Acids Res (1999) 21.47

Aligning two sequences within a specified diagonal band. Comput Appl Biosci (1992) 19.31

Issues in searching molecular sequence databases. Nat Genet (1994) 19.28

On the statistical significance of nucleic acid similarities. Nucleic Acids Res (1984) 18.21

Local alignment statistics. Methods Enzymol (1996) 17.76

PipMaker--a web server for aligning two genomic DNA sequences. Genome Res (2000) 17.46

A workbench for multiple alignment construction and analysis. Proteins (1991) 16.96

Complete genome sequence of Salmonella enterica serovar Typhimurium LT2. Nature (2001) 16.89

Comparative analyses of multi-species sequences from targeted genomic regions. Nature (2003) 13.31

Weights for data related by a tree. J Mol Biol (1989) 12.63

GenBank. Nucleic Acids Res (1997) 11.73

A protein alignment scoring system sensitive at all evolutionary distances. J Mol Evol (1993) 10.53

Identification of a coordinate regulator of interleukins 4, 13, and 5 by cross-species sequence comparisons. Science (2000) 10.14

GenBank. Nucleic Acids Res (1998) 9.36

GenBank. Nucleic Acids Res (1993) 9.06

Long human-mouse sequence alignments reveal novel regulatory elements: a reason to sequence the mouse genome. Genome Res (1997) 8.49

Scoring pairwise genomic sequence alignments. Pac Symp Biocomput (2002) 8.42

Generation and analysis of 280,000 human expressed sequence tags. Genome Res (1996) 8.27

Comparison of DNA sequences with protein sequences. Genomics (1997) 7.76

GenBank. Nucleic Acids Res (1996) 7.06

Mapping sequenced E.coli genes by computer: software, strategies and examples. Nucleic Acids Res (1991) 6.29

Combined use of sequence similarity and codon bias for coding region identification. J Comput Biol (1994) 6.25

GenBank. Nucleic Acids Res (1994) 5.63

Human whole-genome shotgun sequencing. Genome Res (1997) 5.59

Protein database searches for multiple alignments. Proc Natl Acad Sci U S A (1990) 5.52

Large-scale comparative sequence analysis of the human and murine Bruton's tyrosine kinase loci reveals conserved regulatory domains. Genome Res (1997) 4.82

Alignment of Escherichia coli K12 DNA sequences to a genomic restriction map. Nucleic Acids Res (1990) 4.07

A controlled trial of a formalin-inactivated hepatitis A vaccine in healthy children. N Engl J Med (1992) 4.05

Comparative sequence analysis of a gene-rich cluster at human chromosome 12p13 and its syntenic region in mouse chromosome 6. Genome Res (1998) 3.80

Locus control regions of mammalian beta-globin gene clusters: combining phylogenetic analyses and experimental results to gain functional insights. Gene (1997) 3.77

The genome reconstruction manager: a software environment for supporting high-throughput DNA sequencing. Genomics (1994) 3.69

MaskerAid: a performance enhancement to RepeatMasker. Bioinformatics (2000) 3.66

The natural history of recurrent herpes simplex labialis: implications for antiviral therapy. N Engl J Med (1977) 3.61

A local alignment tool for very long DNA sequences. Comput Appl Biosci (1995) 3.58

Use of arsenic trioxide (As2O3) in the treatment of acute promyelocytic leukemia (APL): I. As2O3 exerts dose-dependent dual effects on APL cells. Blood (1997) 3.56

Design of a compartmentalized shotgun assembler for the human genome. Bioinformatics (2001) 3.55

Lineage-specific loss and divergence of functionally linked genes in eukaryotes. Proc Natl Acad Sci U S A (2000) 3.27

MPBLAST : improved BLAST performance with multiplexed queries. Bioinformatics (2000) 3.18

Extracting protein alignment models from the sequence database. Nucleic Acids Res (1997) 3.17

Post-processing long pairwise alignments. Bioinformatics (1999) 3.00

A space-efficient algorithm for local similarities. Comput Appl Biosci (1990) 2.98

Modulation of the two promoters of the galactose operon of Escherichia coli. Nature (1979) 2.91

Comparative analysis of the gene-dense ACHE/TFR2 region on human chromosome 7q22 with the orthologous region on mouse chromosome 5. Nucleic Acids Res (2001) 2.89

A comparison of filtered leukocyte-reduced and cytomegalovirus (CMV) seronegative blood products for the prevention of transfusion-associated CMV infection after marrow transplant. Blood (1995) 2.83

Polycystic ovaries treated by laparoscopic laser vaporization. Fertil Steril (1989) 2.77

Analysis of the quality and utility of random shotgun sequencing at low redundancies. Genome Res (1998) 2.74

Correlation of the highest-energy cosmic rays with nearby extragalactic objects. Science (2007) 2.57

Metabolism of (14C) cefaclor, a cephalosporin antibiotic, in three species of laboratory animals. Antimicrob Agents Chemother (1976) 2.52

Contextual constraints on synonymous codon choice. J Mol Biol (1983) 2.47

Comparative genome analysis delimits a chromosomal domain and identifies key regulatory elements in the alpha globin cluster. Hum Mol Genet (2001) 2.46

Treatment of moderate/severe acute graft-versus-host disease after allogeneic bone marrow transplantation: an analysis of clinical risk features and outcome. Blood (1990) 2.38

Globin gene server: a prototype E-mail database server featuring extensive multiple alignments and data compilation for electronic genetic analysis. Genomics (1994) 2.36

ReAligner: a program for refining DNA sequence multi-alignments. J Comput Biol (1997) 2.25

Comparison of five methods for finding conserved sequences in multiple alignments of gene regulatory regions. Nucleic Acids Res (1999) 2.19

Equal animals. Nature (1990) 2.09

Comparison of the Escherichia coli K-12 genome with sampled genomes of a Klebsiella pneumoniae and three salmonella enterica serovars, Typhimurium, Typhi and Paratyphi. Nucleic Acids Res (2000) 2.07

Recommendations from an international expert panel on the use of neoadjuvant (primary) systemic treatment of operable breast cancer: new perspectives 2006. Ann Oncol (2007) 2.05

Characterization of the human and mouse unconventional myosin XV genes responsible for hereditary deafness DFNB3 and shaker 2. Genomics (1999) 1.97

A tool for aligning very similar DNA sequences. Comput Appl Biosci (1997) 1.97

Alignments without low-scoring regions. J Comput Biol (1998) 1.96

Comparative sequence of human and mouse BAC clones from the mnd2 region of chromosome 2p13. Genome Res (1999) 1.94

Diabetes mellitus: an underestimated public health program. J Chronic Dis (1975) 1.87

Sequence and comparative analysis of the mouse 1-megabase region orthologous to the human 11p15 imprinted domain. Genome Res (2000) 1.85

Chaining multiple-alignment blocks. J Comput Biol (1994) 1.75

Genome sequence comparisons: hurdles in the fast lane to functional genomics. Brief Bioinform (2000) 1.71

Accuracy of bedside chest hard-copy screen-film versus hard- and soft-copy computed radiographs in a medical intensive care unit: receiver operating characteristic analysis. Radiology (1997) 1.69

Software tools for analyzing pairwise alignments of long sequences. Nucleic Acids Res (1991) 1.68

Approximate matching of regular expressions. Bull Math Biol (1989) 1.68

Positive and negative regulatory elements of the rabbit embryonic epsilon-globin gene revealed by an improved multiple alignment program and functional analysis. DNA Seq (1993) 1.66

The Paced Auditory Serial Addition Task (PASAT): norms for age, education, and ethnicity. Assessment (1998) 1.65

Dynamic programming algorithms for biological sequence comparison. Methods Enzymol (1992) 1.65

Racism, society, and disease: an exploration of the social and biological mechanisms of differential mortality. Int J Health Serv (1981) 1.62

Conserved E boxes function as part of the enhancer in hypersensitive site 2 of the beta-globin locus control region. Role of basic helix-loop-helix proteins. J Biol Chem (1997) 1.58

Improved mortality among U.S. Blacks, 1968-1978: the role of antiracist struggle. Int J Health Serv (1981) 1.53

Recent developments in linear-space alignment methods: a survey. J Comput Biol (1994) 1.49

Measurement of the depth of maximum of extensive air showers above 10{18} eV. Phys Rev Lett (2010) 1.48

A randomized trial comparing prednisone with antithymocyte globulin/prednisone as an initial systemic therapy for moderately severe acute graft-versus-host disease. Biol Blood Marrow Transplant (2000) 1.48

Maternal serum screening for alpha-fetoprotein, unconjugated estriol, and human chorionic gonadotropin between 11 and 15 weeks of pregnancy to detect fetal chromosome abnormalities. Am J Obstet Gynecol (1993) 1.44

Cytomegalovirus infection after bone marrow transplantation: an association with acute graft-v-host disease. Blood (1986) 1.40

UTR reconstruction and analysis using genomically aligned EST sequences. Proc Int Conf Intell Syst Mol Biol (2000) 1.33

Hox cluster genomics in the horn shark, Heterodontus francisci. Proc Natl Acad Sci U S A (2000) 1.32