Empirical statistical estimates for sequence similarity searches.

PubWeight™: 4.14‹?› | Rank: Top 1%

🔗 View Article (PMID 9514730)

Published in J Mol Biol on February 13, 1998

Authors

W R Pearson1

Author Affiliations

1: Department of Biochemistry, University of Virginia, Charlottesville 22908, USA.

Articles citing this

Protein sequence similarity searches using patterns as seeds. Nucleic Acids Res (1998) 23.87

Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements. Nucleic Acids Res (2001) 22.33

Using the miraEST assembler for reliable and automated mRNA transcript assembly and SNP detection in sequenced ESTs. Genome Res (2004) 10.40

Comparison of sequence profiles. Strategies for structural predictions using sequence information. Protein Sci (2000) 6.23

The Transporter Classification Database: recent advances. Nucleic Acids Res (2008) 4.26

RSEARCH: finding homologs of single structured RNA sequences. BMC Bioinformatics (2003) 4.14

Sensitivity and selectivity in protein structure comparison. Protein Sci (2004) 3.69

The estimation of statistical parameters for local alignment score distributions. Nucleic Acids Res (2001) 3.61

Composition-based statistics and translated nucleotide searches: improving the TBLASTN module of BLAST. BMC Biol (2006) 2.90

Gain and loss of multiple genes during the evolution of Helicobacter pylori. PLoS Genet (2005) 2.61

Simple fold composition and modular architecture of the nuclear pore complex. Proc Natl Acad Sci U S A (2006) 2.35

Analysis of the yeast transcriptome with structural and functional categories: characterizing highly expressed proteins. Nucleic Acids Res (2000) 2.17

Quantifying the relationships among drug classes. J Chem Inf Model (2008) 1.92

The limits of protein sequence comparison? Curr Opin Struct Biol (2005) 1.57

Pseudo-messenger RNA: phantoms of the transcriptome. PLoS Genet (2006) 1.46

A unified statistical model to support local sequence order independent similarity searching for ligand-binding sites and its application to genome-based drug discovery. Bioinformatics (2009) 1.46

Statistical calibration of the SEQUEST XCorr function. J Proteome Res (2009) 1.43

Identifying mechanism-of-action targets for drugs and probes. Proc Natl Acad Sci U S A (2012) 1.36

KEGG OC: a large-scale automatic construction of taxonomy-based ortholog clusters. Nucleic Acids Res (2012) 1.36

Database searching by flexible protein structure alignment. Protein Sci (2004) 1.34

Hepatitis C virus core protein is a dimeric alpha-helical protein exhibiting membrane protein features. J Virol (2005) 1.34

Kappa-alpha plot derived structural alphabet and BLOSUM-like substitution matrix for rapid search of protein structure database. Genome Biol (2007) 1.30

Comparison of human solute carriers. Protein Sci (2010) 1.29

Accurate statistical model of comparison between multiple sequence alignments. Nucleic Acids Res (2008) 1.28

Chlorite dismutases, DyPs, and EfeB: 3 microbial heme enzyme families comprise the CDE structural superfamily. J Mol Biol (2011) 1.27

BALSA: Bayesian algorithm for local sequence alignment. Nucleic Acids Res (2002) 1.27

Bioinformatic analyses of transmembrane transport: novel software for deducing protein phylogeny, topology, and evolution. J Mol Microbiol Biotechnol (2009) 1.21

Whole-genome analysis of transporters in the plant pathogen Xylella fastidiosa. Microbiol Mol Biol Rev (2002) 1.15

Distribution of introns in fungal histone genes. PLoS One (2011) 1.09

Evaluating synteny for improved comparative studies. Bioinformatics (2014) 1.03

Composition-modified matrices improve identification of homologs of saccharomyces cerevisiae low-complexity glycoproteins. Eukaryot Cell (2006) 1.02

DNA repair and recombination in higher plants: insights from comparative genomics of Arabidopsis and rice. BMC Genomics (2010) 1.00

JACOP: a simple and robust method for the automated classification of protein sequences with modular architecture. BMC Bioinformatics (2005) 0.98

Prediction and evaluation of protein farnesyltransferase inhibition by commercial drugs. J Med Chem (2010) 0.96

The first crystal structure of class III superoxide reductase from Treponema pallidum. J Biol Inorg Chem (2006) 0.95

Position weight matrix, gibbs sampler, and the associated significance tests in motif characterization and prediction. Scientifica (Cairo) (2012) 0.92

A resource for transcriptomic analysis in the mouse brain. PLoS One (2008) 0.91

Evolutionary bases of carbohydrate recognition and substrate discrimination in the ROK protein family. J Mol Evol (2010) 0.90

The transporter-opsin-G protein-coupled receptor (TOG) superfamily. FEBS J (2013) 0.88

Island method for estimating the statistical significance of profile-profile alignment scores. BMC Bioinformatics (2009) 0.87

Origin and fate of pseudogenes in Hemiascomycetes: a comparative analysis. BMC Genomics (2010) 0.87

Where does the alignment score distribution shape come from? Evol Bioinform Online (2010) 0.86

Conserved determinants for membrane association of nonstructural protein 5A from hepatitis C virus and related viruses. J Virol (2006) 0.86

Pairwise statistical significance of local sequence alignment using multiple parameter sets and empirical justification of parameter set change penalty. BMC Bioinformatics (2009) 0.85

Statistical distributions of optimal global alignment scores of random protein sequences. BMC Bioinformatics (2005) 0.85

Evolution of biological sequences implies an extreme value distribution of type I for both global and local pairwise alignment scores. BMC Bioinformatics (2008) 0.84

BSSF: a fingerprint based ultrafast binding site similarity search and function analysis server. BMC Bioinformatics (2010) 0.84

Dynamic use of multiple parameter sets in sequence alignment. Nucleic Acids Res (2006) 0.84

A simple derivation of the distribution of pairwise local protein sequence alignment scores. Evol Bioinform Online (2008) 0.83

Activation of Src and transformation by an RPTPα splice mutant found in human tumours. EMBO J (2011) 0.83

Use of residue pairs in protein sequence-sequence and sequence-structure alignments. Protein Sci (2000) 0.82

Proteny: discovering and visualizing statistically significant syntenic clusters at the proteome level. Bioinformatics (2015) 0.79

The distance-profile representation and its application to detection of distantly related protein families. BMC Bioinformatics (2005) 0.77

The intein of the Thermoplasma A-ATPase A subunit: structure, evolution and expression in E. coli. BMC Biochem (2001) 0.77

Improving chemical similarity ensemble approach in target prediction. J Cheminform (2016) 0.76

ProteinWorldDB: querying radical pairwise alignments among protein sets from complete genomes. Bioinformatics (2010) 0.75

Inhibition of Mycobacterium-RmlA by Molecular Modeling, Dynamics Simulation, and Docking. Adv Bioinformatics (2016) 0.75

Accelerating pairwise statistical significance estimation for local alignment by harvesting GPU's power. BMC Bioinformatics (2012) 0.75

Extraction of tentative mobile introns in fungal histone genes. Mob Genet Elements (2011) 0.75

Molecular Evolutionary Constraints that Determine the Avirulence State of Clostridium botulinum C2 Toxin. J Mol Evol (2017) 0.75

Identification of a specific agonist of human TAS2R14 from Radix Bupleuri through virtual screening, functional evaluation and binding studies. Sci Rep (2017) 0.75

Articles by these authors

Comparison of methods for searching protein sequence databases. Protein Sci (1995) 4.29