Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

PubWeight™: 665.31‹?› | Rank: Top 0.01% | All-Time Top 10

🔗 View Article (PMC 146917)

Published in Nucleic Acids Res on September 01, 1997


S F Altschul1, T L Madden, A A Schäffer, J Zhang, Z Zhang, W Miller, D J Lipman

Author Affiliations

1: National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.

Associated clinical trials:

Treatment of Nonalcoholic Fatty Liver Disease With Probiotics and Prebiotics | NCT00870012

Articles citing this

(truncated to the top 100)

MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res (2004) 168.89

Mapping short DNA sequencing reads and calling variants using mapping quality scores. Genome Res (2008) 157.44

BLAT--the BLAST-like alignment tool. Genome Res (2002) 126.78

The COG database: an updated version includes eukaryotes. BMC Bioinformatics (2003) 60.98

The Bioperl toolkit: Perl modules for the life sciences. Genome Res (2002) 58.63

ARB: a software environment for sequence data. Nucleic Acids Res (2004) 58.27

Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics (2010) 52.01

The Pfam protein families database. Nucleic Acids Res (2002) 51.34

MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics (2004) 50.89

Versatile and open software for comparing large genomes. Genome Biol (2004) 49.45

The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res (2000) 49.22

SSAHA: a fast search method for large DNA databases. Genome Res (2001) 48.64

NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res (2006) 48.10

MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res (2002) 47.62

A gene atlas of the mouse and human protein-encoding transcriptomes. Proc Natl Acad Sci U S A (2004) 44.81

A human gut microbial gene catalogue established by metagenomic sequencing. Nature (2010) 43.63

The COG database: new developments in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res (2001) 43.17

Microbial diversity in the deep sea and the underexplored "rare biosphere". Proc Natl Acad Sci U S A (2006) 42.38

The Ensembl genome database project. Nucleic Acids Res (2002) 40.87

Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat Protoc (2009) 38.62

The Pfam protein families database. Nucleic Acids Res (2009) 37.98

NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res (2005) 37.39

SMART, a simple modular architecture research tool: identification of signaling domains. Proc Natl Acad Sci U S A (1998) 36.83

GenBank. Nucleic Acids Res (2000) 36.75

BLAST+: architecture and applications. BMC Bioinformatics (2009) 36.53

Human-mouse alignments with BLASTZ. Genome Res (2003) 35.49

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2000) 34.79

Obesity alters gut microbial ecology. Proc Natl Acad Sci U S A (2005) 33.66

Protein structure prediction on the Web: a case study using the Phyre server. Nat Protoc (2009) 32.64

MAFFT version 5: improvement in accuracy of multiple sequence alignment. Nucleic Acids Res (2005) 31.64

The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST). Nucleic Acids Res (2013) 29.49

KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic Acids Res (2007) 29.46

UCHIME improves sensitivity and speed of chimera detection. Bioinformatics (2011) 29.22

The metagenomics RAST server - a public resource for the automatic phylogenetic and functional analysis of metagenomes. BMC Bioinformatics (2008) 29.20

Predicting deleterious amino acid substitutions. Genome Res (2001) 28.95

The Carbohydrate-Active EnZymes database (CAZy): an expert resource for Glycogenomics. Nucleic Acids Res (2008) 27.83

NCBI Reference Sequences: current status, policy and new initiatives. Nucleic Acids Res (2008) 26.04

SWISS-MODEL: An automated protein homology-modeling server. Nucleic Acids Res (2003) 25.86

An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res (2002) 25.81

GenBank. Nucleic Acids Res (2007) 25.54

Rfam: annotating non-coding RNAs in complete genomes. Nucleic Acids Res (2005) 25.49

GeneMark.hmm: new solutions for gene finding. Nucleic Acids Res (1998) 25.21

The Ribosomal Database Project (RDP-II): sequences and tools for high-throughput rRNA analysis. Nucleic Acids Res (2005) 24.85

Automated generation of heuristics for biological sequence comparison. BMC Bioinformatics (2005) 24.54

SeqMap: mapping massive amount of oligonucleotides to the genome. Bioinformatics (2008) 24.32

Protein sequence similarity searches using patterns as seeds. Nucleic Acids Res (1998) 23.87

LAGAN and Multi-LAGAN: efficient tools for large-scale multiple alignment of genomic DNA. Genome Res (2003) 23.03

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2005) 22.98

Rfam: an RNA family database. Nucleic Acids Res (2003) 22.93

Assigning protein functions by comparative genome analysis: protein phylogenetic profiles. Proc Natl Acad Sci U S A (1999) 22.80

I-TASSER: a unified platform for automated protein structure and function prediction. Nat Protoc (2010) 22.66

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2007) 22.53

Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements. Nucleic Acids Res (2001) 22.33

The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res (2005) 21.68

PANTHER: a library of protein families and subfamilies indexed by function. Genome Res (2003) 21.64

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2008) 21.36

TIGRFAMs: a protein family resource for the functional identification of proteins. Nucleic Acids Res (2001) 20.84

PyCogent: a toolkit for making sense from sequence. Genome Biol (2007) 20.64

Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences. Proc Natl Acad Sci U S A (2002) 20.48

PIRSF: family classification system at the Protein Information Resource. Nucleic Acids Res (2004) 19.62

Database resources of the National Center for Biotechnology Information: 2002 update. Nucleic Acids Res (2002) 19.40

SMART 4.0: towards genomic data integration. Nucleic Acids Res (2004) 19.37

GenBank. Nucleic Acids Res (2005) 19.25

Ab initio gene finding in Drosophila genomic DNA. Genome Res (2000) 19.23

GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res (2012) 19.19

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2001) 19.13

PHAST: a fast phage search tool. Nucleic Acids Res (2011) 19.09

CDD: a Conserved Domain Database for the functional annotation of proteins. Nucleic Acids Res (2010) 19.07

GenDB--an open source genome annotation system for prokaryote genomes. Nucleic Acids Res (2003) 18.88

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2006) 18.85

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2006) 18.84

InterProScan: protein domains identifier. Nucleic Acids Res (2005) 18.82

CDD: a database of conserved domain alignments with links to domain three-dimensional structure. Nucleic Acids Res (2002) 18.54

I-TASSER server for protein 3D structure prediction. BMC Bioinformatics (2008) 18.28

ProDom and ProDom-CG: tools for protein domain analysis and whole genome comparisons. Nucleic Acids Res (2000) 18.27

Database resources of the National Center for Biotechnology. Nucleic Acids Res (2003) 18.26

A survey of sequence alignment algorithms for next-generation sequencing. Brief Bioinform (2010) 18.05

Genome sequence of the palaeopolyploid soybean. Nature (2010) 17.82

SMART: a web-based tool for the study of genetically mobile domains. Nucleic Acids Res (2000) 17.77

The yeast proteome database (YPD) and Caenorhabditis elegans proteome database (WormPD): comprehensive resources for the organization and comparison of model organism protein information. Nucleic Acids Res (2000) 17.68

PipMaker--a web server for aligning two genomic DNA sequences. Genome Res (2000) 17.46

ExPASy: The proteomics server for in-depth protein knowledge and analysis. Nucleic Acids Res (2003) 17.39

GenBank. Nucleic Acids Res (2002) 17.24

Shifting the genomic gold standard for the prokaryotic species definition. Proc Natl Acad Sci U S A (2009) 17.13

GenBank. Nucleic Acids Res (2007) 16.92

CD-Search: protein domain annotations on the fly. Nucleic Acids Res (2004) 16.76

16S ribosomal DNA sequence analysis of a large collection of environmental and clinical unidentifiable bacterial isolates. J Clin Microbiol (2000) 16.27

YPD, PombePD and WormPD: model organism volumes of the BioKnowledge library, an integrated resource for protein information. Nucleic Acids Res (2001) 16.22

BLAST: at the core of a powerful and diverse set of sequence analysis tools. Nucleic Acids Res (2004) 15.43

At least 1 in 20 16S rRNA sequence records currently held in public repositories is estimated to contain substantial anomalies. Appl Environ Microbiol (2005) 15.41

SNAP predicts effect of mutations on protein function. Bioinformatics (2008) 15.39

Accurate determination of microbial diversity from 454 pyrosequencing data. Nat Methods (2009) 15.25

Accelerated Profile HMM Searches. PLoS Comput Biol (2011) 15.22

DrugBank: a comprehensive resource for in silico drug discovery and exploration. Nucleic Acids Res (2006) 15.19

A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS One (2011) 15.19

NCBI BLAST: a better web interface. Nucleic Acids Res (2008) 15.14

Development of human protein reference database as an initial platform for approaching systems biology in humans. Genome Res (2003) 14.79

The intronerator: exploring introns and alternative splicing in Caenorhabditis elegans. Nucleic Acids Res (2000) 14.77

Cloning of a human parvovirus by molecular screening of respiratory tract samples. Proc Natl Acad Sci U S A (2005) 14.71

Comparative analysis of human gut microbiota by barcoded pyrosequencing. PLoS One (2008) 14.63

Articles cited by this

Basic local alignment search tool. J Mol Biol (1990) 659.07

Improved tools for biological sequence comparison. Proc Natl Acad Sci U S A (1988) 193.60

A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol (1970) 155.96

Identification of common molecular subsequences. J Mol Biol (1981) 130.53

Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci U S A (1992) 61.33

Rapid similarity searches of nucleic acid and protein data banks. Proc Natl Acad Sci U S A (1983) 53.12

Complete genome sequence of the methanogenic archaeon, Methanococcus jannaschii. Science (1996) 41.35

Optimal alignments in linear space. Comput Appl Biosci (1988) 38.10

Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment. Science (1993) 36.84

A strong candidate for the breast and ovarian cancer susceptibility gene BRCA1. Science (1994) 36.53

Database of homology-derived protein structures and the structural meaning of sequence alignment. Proteins (1991) 32.50

Improved sensitivity of profile searches through the use of sequence weights and gap excision. Comput Appl Biosci (1994) 31.96

Information content of binding sites on nucleotide sequences. J Mol Biol (1986) 30.48

Profile analysis: detection of distantly related proteins. Proc Natl Acad Sci U S A (1987) 29.26

Sequence analysis of the genome of the unicellular cyanobacterium Synechocystis sp. strain PCC6803. II. Sequence determination of the entire genome and assignment of potential protein-coding regions. DNA Res (1996) 28.37

Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes. Proc Natl Acad Sci U S A (1990) 24.42

Position-based sequence weights. J Mol Biol (1994) 24.41

Amino acid substitution matrices from an information theoretic perspective. J Mol Biol (1991) 23.38

An improved algorithm for matching biological sequences. J Mol Biol (1982) 21.95

Computer methods to locate signals in nucleic acid sequences. Nucleic Acids Res (1984) 21.53

2.2 Mb of contiguous nucleotide sequence from chromosome III of C. elegans. Nature (1994) 21.30

Dirichlet mixtures: a method for improved detection of weak but significant protein sequence homology. Comput Appl Biosci (1996) 19.74

Aligning two sequences within a specified diagonal band. Comput Appl Biosci (1992) 19.31

Issues in searching molecular sequence databases. Nat Genet (1994) 19.28

Detection of conserved segments in proteins: iterative scanning of sequence databases with alignment blocks. Proc Natl Acad Sci U S A (1994) 18.46

Local alignment statistics. Methods Enzymol (1996) 17.76

Selection of DNA binding sites by regulatory proteins. Statistical-mechanical theory and application to operators and promoters. J Mol Biol (1987) 17.53

Maximum discrimination hidden Markov models of sequence consensus. J Comput Biol (1995) 15.75

The FHIT gene, spanning the chromosome 3p14.2 fragile site and renal carcinoma-associated t(3;8) breakpoint, is abnormal in digestive tract cancers. Cell (1996) 15.19

A superfamily of conserved domains in DNA damage-responsive cell cycle checkpoint proteins. FASEB J (1997) 15.10

Optimal sequence alignments. Proc Natl Acad Sci U S A (1983) 14.64

Identifying protein-binding sites from unaligned DNA fragments. Proc Natl Acad Sci U S A (1989) 14.45

Complete structure of the hemagglutinin gene from the human influenza A/Victoria/3/75 (H3N2) strain as determined from cloned DNA. Cell (1980) 14.28

Identification of protein sequence homology by consensus template alignment. J Mol Biol (1986) 13.73

A weighting system and algorithm for aligning many phylogenetically related sequences. Comput Appl Biosci (1995) 13.48

A flexible motif search technique based on generalized profiles. Comput Chem (1996) 13.22

Identification of a RING protein that can interact in vivo with the BRCA1 gene product. Nat Genet (1996) 12.96

Weights for data related by a tree. J Mol Biol (1989) 12.63

From BRCA1 to RAP1: a widespread BRCT module closely associated with DNA repair. FEBS Lett (1997) 12.43

The SWISS-PROT protein sequence data bank and its supplement TrEMBL. Nucleic Acids Res (1997) 12.30

Optimal sequence alignment using affine gap costs. Bull Math Biol (1986) 12.15

Applications and statistics for multiple high-scoring segments in molecular sequences. Proc Natl Acad Sci U S A (1993) 12.10

Volume changes in protein evolution. J Mol Biol (1994) 12.07

A workbench for large-scale sequence homology analysis. Comput Appl Biosci (1994) 12.00

The statistical distribution of nucleic acid similarities. Nucleic Acids Res (1985) 11.99

A new algorithm for best subsequence alignments with application to tRNA-rRNA comparisons. J Mol Biol (1987) 11.83

GenBank. Nucleic Acids Res (1997) 11.73

Detecting homology of distantly related proteins with consensus sequences. J Mol Biol (1987) 11.70

Systematic method for the detection of potential lambda Cro-like DNA-binding regions in proteins. J Mol Biol (1987) 11.67

BRCA1 protein products ... Functional motifs... Nat Genet (1996) 11.50

Weighting aligned protein or nucleic acid sequences to correct for unequal representation. J Mol Biol (1990) 11.50

Using substitution probabilities to improve position-specific scoring matrices. Comput Appl Biosci (1996) 11.32

The significance of protein sequence similarities. Comput Appl Biosci (1988) 11.26

Embedding strategies for effective use of information from multiple sequence alignments. Protein Sci (1997) 11.25

Analysis of gene duplication repeats in the myosin rod. J Mol Biol (1983) 11.12

Using Dirichlet mixture priors to derive hidden Markov models for protein families. Proc Int Conf Intell Syst Mol Biol (1993) 10.73

Distribution of glutamine and asparagine residues and their near neighbors in peptides and proteins. Proc Natl Acad Sci U S A (1991) 10.60

A protein alignment scoring system sensitive at all evolutionary distances. J Mol Evol (1993) 10.53

Insertional mutagenesis in zebrafish identifies two novel genes, pescadillo and dead eye, essential for embryonic development. Genes Dev (1996) 10.27

Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain. DNA Res (1996) 10.06

Recognition of related proteins by iterative template refinement (ITR). Protein Sci (1994) 9.67

Isolation, characterization, and inactivation of the APA1 gene encoding yeast diadenosine 5',5'''-P1,P4-tetraphosphate phosphorylase. J Bacteriol (1989) 9.34

Sequence analysis in the E1 region of adenovirus type 4 DNA. Virology (1986) 9.23

The gal locus from Haemophilus influenzae: cloning, sequencing and the use of gal mutants to study lipopolysaccharide. Mol Microbiol (1992) 9.22

New structure--novel fold? Structure (1997) 9.19

Locally optimal subalignments using nonlinear similarity functions. Bull Math Biol (1986) 9.10

The amino acid sequence of leghaemoglobin I from root nodules of broad bean (Vicia faba L.). FEBS Lett (1975) 9.08

[Hemoglobins, XXXIII. Note on the Sequence of the hemoglobins of the horse (author's transl)]. Hoppe Seylers Z Physiol Chem (1980) 9.05

Rat galactose-1-phosphate uridyltransferase coding sequence, transcription start site and genomic organization. DNA Seq (1993) 9.01

Articles by these authors

Basic local alignment search tool. J Mol Biol (1990) 659.07

Improved tools for biological sequence comparison. Proc Natl Acad Sci U S A (1988) 193.60

The sequence of the human genome. Science (2001) 101.55

Rapid and sensitive protein similarity searches. Science (1985) 76.83

Rapid similarity searches of nucleic acid and protein data banks. Proc Natl Acad Sci U S A (1983) 53.12

A genomic perspective on protein families. Science (1997) 50.51

A greedy algorithm for aligning DNA sequences. J Comput Biol (2000) 47.89

Optimal alignments in linear space. Comput Appl Biosci (1988) 38.10

GenBank. Nucleic Acids Res (2000) 36.75

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2000) 34.79

Comparative genomics of the eukaryotes. Science (2000) 26.62

BLAST 2 Sequences, a new tool for comparing protein and nucleotide sequences. FEMS Microbiol Lett (1999) 25.40

Protein sequence similarity searches using patterns as seeds. Nucleic Acids Res (1998) 23.87

The UCSC Genome Browser Database: 2008 update. Nucleic Acids Res (2007) 23.13

A computer program for aligning a cDNA sequence with a genomic DNA sequence. Genome Res (1998) 22.69

Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements. Nucleic Acids Res (2001) 22.33

Effects of losartan on renal and cardiovascular outcomes in patients with type 2 diabetes and nephropathy. N Engl J Med (2001) 22.32

GenBank. Nucleic Acids Res (1999) 21.47

Aligning two sequences within a specified diagonal band. Comput Appl Biosci (1992) 19.31

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2001) 19.13

Faster sequential genetic linkage computations. Am J Hum Genet (1993) 18.83

On the statistical significance of nucleic acid similarities. Nucleic Acids Res (1984) 18.21

PipMaker--a web server for aligning two genomic DNA sequences. Genome Res (2000) 17.46

A workbench for multiple alignment construction and analysis. Proteins (1991) 16.96

Complete genome sequence of Salmonella enterica serovar Typhimurium LT2. Nature (2001) 16.89

Comparative analyses of multi-species sequences from targeted genomic regions. Nature (2003) 13.31

Weights for data related by a tree. J Mol Biol (1989) 12.63

Requirement of interleukin 17 receptor signaling for lung CXC chemokine and granulocyte colony-stimulating factor expression, neutrophil recruitment, and host defense. J Exp Med (2001) 12.38

GenBank. Nucleic Acids Res (1997) 11.73

Identification of a coordinate regulator of interleukins 4, 13, and 5 by cross-species sequence comparisons. Science (2000) 10.14

GenBank. Nucleic Acids Res (1998) 9.36

GenBank. Nucleic Acids Res (1993) 9.06

Long human-mouse sequence alignments reveal novel regulatory elements: a reason to sequence the mouse genome. Genome Res (1997) 8.49

Scoring pairwise genomic sequence alignments. Pac Symp Biocomput (2002) 8.42

The vasorelaxant effect of H(2)S as a novel endogenous gaseous K(ATP) channel opener. EMBO J (2001) 7.97

Sox9 is required for cartilage formation. Nat Genet (1999) 7.89

Comparison of DNA sequences with protein sequences. Genomics (1997) 7.76

The pyroptosome: a supramolecular assembly of ASC dimers mediating inflammatory cell death via caspase-1 activation. Cell Death Differ (2007) 7.74

Engineering the provitamin A (beta-carotene) biosynthetic pathway into (carotenoid-free) rice endosperm. Science (2000) 7.53

GenBank. Nucleic Acids Res (1996) 7.06

Therapeutic benefit of intravenous administration of bone marrow stromal cells after cerebral ischemia in rats. Stroke (2001) 6.61

Validity and reliability of a quantitative computed tomography score in predicting outcome of hyperacute stroke before thrombolytic therapy. ASPECTS Study Group. Alberta Stroke Programme Early CT Score. Lancet (2000) 6.53

Embryonic stem cell-derived microvesicles reprogram hematopoietic progenitors: evidence for horizontal transfer of mRNA and protein delivery. Leukemia (2006) 6.31

Mapping sequenced E.coli genes by computer: software, strategies and examples. Nucleic Acids Res (1991) 6.29

A potential vulnerability locus for schizophrenia on chromosome 6p24-22: evidence for genetic heterogeneity. Nat Genet (1995) 5.87

Sequence diversity in CYP3A promoters and characterization of the genetic basis of polymorphic CYP3A5 expression. Nat Genet (2001) 5.80

GenBank. Nucleic Acids Res (1994) 5.63

The BCL-6 proto-oncogene controls germinal-centre formation and Th2-type inflammation. Nat Genet (1997) 5.55

Protein database searches for multiple alignments. Proc Natl Acad Sci U S A (1990) 5.52

Sites of specific B cell activation in primary and secondary responses to T cell-dependent and T cell-independent antigens. Eur J Immunol (1991) 5.38

The receptor for advanced glycation end products (RAGE) is a cellular binding site for amphoterin. Mediation of neurite outgrowth and co-expression of rage and amphoterin in the developing nervous system. J Biol Chem (1995) 5.36

A kinase-cyclin pair in the RNA polymerase II holoenzyme. Nature (1995) 4.93

Why are stroke patients excluded from TPA therapy? An analysis of patient eligibility. Neurology (2001) 4.84

Large-scale comparative sequence analysis of the human and murine Bruton's tyrosine kinase loci reveals conserved regulatory domains. Genome Res (1997) 4.82

HCFA's racial and ethnic data: current accuracy and recent improvements. Health Care Financ Rev (2000) 4.67

Roles of PLC-beta2 and -beta3 and PI3Kgamma in chemoattractant-mediated signal transduction. Science (2000) 4.61

Generation of a prostate epithelial cell-specific Cre transgenic mouse model for tissue-specific gene ablation. Mech Dev (2001) 4.47

CSF tau/Abeta42 ratio for increased risk of mild cognitive impairment: a follow-up study. Neurology (2007) 4.24

Sequence specificity in the dimerization of transmembrane alpha-helices. Biochemistry (1992) 4.19

An essential role in liver development for transcription factor XBP-1. Genes Dev (2000) 4.17

Haploinsufficiency of Sox9 results in defective cartilage primordia and premature skeletal mineralization. Proc Natl Acad Sci U S A (2001) 4.13

Comparative analysis of 1196 orthologous mouse and human full-length mRNA and protein sequences. Genome Res (1996) 4.11

The transcription factors L-Sox5 and Sox6 are essential for cartilage formation. Dev Cell (2001) 4.11

PowerBLAST: a new network BLAST application for interactive or automated sequence analysis and annotation. Genome Res (1997) 4.10

Alignment of Escherichia coli K12 DNA sequences to a genomic restriction map. Nucleic Acids Res (1990) 4.07

A controlled trial of a formalin-inactivated hepatitis A vaccine in healthy children. N Engl J Med (1992) 4.05

The nuclear receptor CAR mediates specific xenobiotic induction of drug metabolism. Nature (2000) 4.00

Mutations in TNFRSF13B encoding TACI are associated with common variable immunodeficiency in humans. Nat Genet (2005) 3.97

Effect of long-term estrogen deprivation on apoptotic responses of breast cancer cells to 17beta-estradiol. J Natl Cancer Inst (2001) 3.88

Mice deficient in BACE1, the Alzheimer's beta-secretase, have normal phenotype and abolished beta-amyloid generation. Nat Neurosci (2001) 3.85

Rapid and highly efficient transduction by double-stranded adeno-associated virus vectors in vitro and in vivo. Gene Ther (2003) 3.85

The pathogenesis and diagnosis of foot-and-mouth disease. J Comp Pathol (2003) 3.83

Comparative sequence analysis of a gene-rich cluster at human chromosome 12p13 and its syntenic region in mouse chromosome 6. Genome Res (1998) 3.80

Insulin-degrading enzyme regulates extracellular levels of amyloid beta-protein by degradation. J Biol Chem (1998) 3.77

Locus control regions of mammalian beta-globin gene clusters: combining phylogenetic analyses and experimental results to gain functional insights. Gene (1997) 3.77

Cryopyrin and pyrin activate caspase-1, but not NF-kappaB, via ASC oligomerization. Cell Death Differ (2006) 3.77

Enhanced cellular oxidant stress by the interaction of advanced glycation end products with their receptors/binding proteins. J Biol Chem (1994) 3.73

The correlation between cotransplantation of mesenchymal stem cells and higher recurrence rate in hematologic malignancy patients: outcome of a pilot clinical study. Leukemia (2008) 3.69

Identification of a new catenin: the tyrosine kinase substrate p120cas associates with E-cadherin complexes. Mol Cell Biol (1994) 3.66

Role of miR-143 targeting KRAS in colorectal tumorigenesis. Oncogene (2009) 3.64

Progression of kidney dysfunction in the community-dwelling elderly. Kidney Int (2006) 3.64

The natural history of recurrent herpes simplex labialis: implications for antiviral therapy. N Engl J Med (1977) 3.61

The estimation of statistical parameters for local alignment score distributions. Nucleic Acids Res (2001) 3.61

Dachshund and eyes absent proteins form a complex and function synergistically to induce ectopic eye development in Drosophila. Cell (1997) 3.58

A local alignment tool for very long DNA sequences. Comput Appl Biosci (1995) 3.58

Use of arsenic trioxide (As2O3) in the treatment of acute promyelocytic leukemia (APL): I. As2O3 exerts dose-dependent dual effects on APL cells. Blood (1997) 3.56

Pten regulates neuronal soma size: a mouse model of Lhermitte-Duclos disease. Nat Genet (2001) 3.48

The beta2-adrenergic receptor/betaarrestin complex recruits the clathrin adaptor AP-2 during endocytosis. Proc Natl Acad Sci U S A (1999) 3.43

The knockout of miR-143 and -145 alters smooth muscle cell maintenance and vascular homeostasis in mice: correlates with human disease. Cell Death Differ (2009) 3.43

Identification of an angiogenic mitogen selective for endocrine gland endothelium. Nature (2001) 3.42

Rapid clearance of fetal DNA from maternal plasma. Am J Hum Genet (1999) 3.39