String kernels for protein sequence comparisons: improved fold recognition.

PubWeight™: 0.75‹?›

🔗 View Article (PMID 28245816)

Published in BMC Bioinformatics on February 28, 2017

Authors

Saghi Nojoomi1, Patrice Koehl2

Author Affiliations

1: Biotechnology program, University of California, Davis, 1, Shields Avenue, Davis, CA, 95616, USA.
2: Department of Computer Science and Genome Center, 1, Shields Avenue, Davis, CA, 95616, USA. pakoehl@ucdavis.edu.

Articles cited by this

Improved tools for biological sequence comparison. Proc Natl Acad Sci U S A (1988) 193.60

The Protein Data Bank. Nucleic Acids Res (2000) 187.10

A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol (1970) 155.96

Identification of common molecular subsequences. J Mol Biol (1981) 130.53

The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res (2000) 67.44

Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci U S A (1992) 61.33

The rapid generation of mutation data matrices from protein sequences. Comput Appl Biosci (1992) 44.38

PatternHunter: faster and more sensitive homology search. Bioinformatics (2002) 35.65

Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol (2011) 28.61

Construction of phylogenetic trees. Science (1967) 23.69

UniProt: a hub for protein information. Nucleic Acids Res (2014) 16.72

Phylogenies from molecular sequences: inference and reliability. Annu Rev Genet (1988) 15.73

Twilight zone of protein sequence alignments. Protein Eng (1999) 9.83

Macromolecular modeling with rosetta. Annu Rev Biochem (2008) 8.32

Alignment-free sequence comparison-a review. Bioinformatics (2003) 6.29

A simulation comparison of phylogeny algorithms under equal and unequal evolutionary rates. Mol Biol Evol (1994) 5.26

Use of receiver operating characteristic (ROC) analysis to evaluate sequence matching. Comput Chem (1996) 5.16

Alignment uncertainty and genomic analysis. Science (2008) 4.09

Comprehensive evaluation of protein structure alignment methods: scoring by geometric measures. J Mol Biol (2005) 4.02

Mismatch string kernels for discriminative protein classification. Bioinformatics (2004) 3.27

Support vector machines and kernels for computational biology. PLoS Comput Biol (2008) 3.16

Structural similarity of DNA-binding domains of bacteriophage repressors and the globin core. Curr Biol (1993) 2.53

A discriminative framework for detecting remote protein homologies. J Comput Biol (2000) 2.30

Protein homology detection using string alignment kernels. Bioinformatics (2004) 2.13

Multiple sequence alignments. Curr Opin Struct Biol (2005) 2.05

Combining pairwise sequence similarity and support vector machines for detecting remote protein evolutionary and structural relationships. J Comput Biol (2003) 2.04

CATH: comprehensive structural and functional annotations for genome sequences. Nucleic Acids Res (2014) 2.03

A novel approach to predicting protein structural classes in a (20-1)-D amino acid composition space. Proteins (1995) 1.84

The average common substring approach to phylogenomic reconstruction. J Comput Biol (2006) 1.79

Intrinsically disordered proteins and intrinsically disordered protein regions. Annu Rev Biochem (2014) 1.79

The protein folding problem: when will it be solved? Curr Opin Struct Biol (2007) 1.70

RASE: recognition of alternatively spliced exons in C.elegans. Bioinformatics (2005) 1.65

Next-generation phylogenomics. Biol Direct (2013) 1.57

Accounting for alignment uncertainty in phylogenomics. PLoS One (2012) 1.35

Freeing phylogenies from artifacts of alignment. Mol Biol Evol (1992) 1.24

Fast alignment-free sequence comparison using spaced-word frequencies. Bioinformatics (2014) 1.13

Is multiple-sequence alignment required for accurate inference of phylogeny? Syst Biol (2007) 1.12

Approximate p-values for local sequence alignments: numerical studies. J Comput Biol (2001) 1.11

Alignment-free genetic sequence comparisons: a review of recent approaches by word analysis. Brief Bioinform (2013) 1.06

Alignment-free phylogenetics and population genetics. Brief Bioinform (2013) 0.97

Using grey dynamic modeling and pseudo amino acid composition to predict protein structural classes. J Comput Chem (2008) 0.95

PHYSEAN: PHYsical SEquence ANalysis for the identification of protein domains on the basis of physical and chemical properties of amino acids. Bioinformatics (1999) 0.95

Optimizing amino acid substitution matrices with a local alignment kernel. BMC Bioinformatics (2006) 0.94

Optimization of a new score function for the detection of remote homologs. Proteins (2000) 0.94

Inferring phylogenies of evolving sequences without multiple sequence alignment. Sci Rep (2014) 0.94

Structural alphabets for protein structure classification: a comparison study. J Mol Biol (2008) 0.90

Multiple sequence alignment modeling: methods and applications. Brief Bioinform (2015) 0.89

Amino acid substitution matrices. Adv Protein Chem (2000) 0.87

Editorial: Alignment-free methods in computational biology. Brief Bioinform (2014) 0.84

Pattern recognition and probabilistic measures in alignment-free sequence analysis. Brief Bioinform (2013) 0.83

3D representations of amino acids-applications to protein sequence comparison and classification. Comput Struct Biotechnol J (2014) 0.77

Phylogenetic Tree Estimation With and Without Alignment: New Distance Methods and Benchmarking. Syst Biol (2016) 0.75