A top-down approach to classify enzyme functional classes and sub-classes using random forest.

PubWeight™: 0.80‹?›

🔗 View Article (PMC 3351021)

Published in EURASIP J Bioinform Syst Biol on February 29, 2012

Authors

Chetan Kumar1, Alok Choudhary

Author Affiliations

1: Department of Electrical Engineering and Computer Science, Northwestern University, Evanston, IL 60201, USA. chetankumar.iisc@gmail.com.

Articles cited by this

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res (1997) 665.31

EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet (2000) 69.26

Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics (2006) 43.68

The ENZYME database in 2000. Nucleic Acids Res (2000) 23.85

Gene selection and classification of microarray data using random forest. BMC Bioinformatics (2006) 12.45

Predicting enzyme function from sequence: a systematic appraisal. Proc Int Conf Intell Syst Mol Biol (1997) 2.11

Pectin degrading glycoside hydrolases of family 28: sequence-structural features, specificities and evolution. Protein Eng (2001) 1.40

EzyPred: a top-down approach for predicting enzyme functional classes and subclasses. Biochem Biophys Res Commun (2007) 1.22

Predicting enzyme class from protein structure without alignments. J Mol Biol (2005) 1.18

Protein function classification via support vector machine approach. Math Biosci (2003) 1.11

Feature selection and the class imbalance problem in predicting protein function from sequence. Appl Bioinformatics (2005) 1.03

Predicting functional family of novel enzymes irrespective of sequence similarity: a statistical learning approach. Nucleic Acids Res (2004) 1.00

Prediction of enzyme family classes. J Proteome Res (2003) 0.97

Prediction of novel archaeal enzymes from sequence-derived features. Protein Sci (2002) 0.95

Analysis on conservation of disulphide bonds and their structural features in homologous protein domain families. BMC Struct Biol (2008) 0.91

Enzyme function prediction with interpretable models. Methods Mol Biol (2009) 0.81

Prediction of enzymes and non-enzymes from protein sequences based on sequence derived features and PSSM matrix using artificial neural network. Bioinformation (2007) 0.79