Towards a semi-automatic functional annotation tool based on decision-tree techniques.

PubWeight™: 0.84‹?›

🔗 View Article (PMC 2654970)

Published in BMC Proc on December 17, 2008

Authors

Jérôme Azé1, Lucie Gentils1, Claire Toffano-Nioche1, Valentin Loux2, Jean-François Gibrat2, Philippe Bessières2, Céline Rouveirol3, Anne Poupon4, Christine Froidevaux1

Author Affiliations

1: LRI - CNRS UMR 8623 - University Paris-Sud 11, F-91405 Orsay Cedex, France.
2: INRA, Unité Mathématique, Informatique et Génome UR1077, F-78352 Jouy-en-Josas, France.
3: LIPN - UMR CNRS 7030 - Institut Galilée - University Paris-Nord, F-93430 Villetaneuse, France.
4: IBBMC - CNRS UMR 8619 - University Paris-Sud 11, F-91405 Orsay Cedex, France.

Articles cited by this

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res (1997) 665.31

Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet (2000) 336.52

The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res (2000) 67.44

InterProScan--an integration platform for the signature-recognition methods in InterPro. Bioinformatics (2001) 28.35

The Universal Protein Resource (UniProt). Nucleic Acids Res (2005) 23.66

The Sequence Ontology: a tool for the unification of genome annotations. Genome Biol (2005) 18.20

Automated annotation of microbial proteomes in SWISS-PROT. Comput Biol Chem (2003) 13.44

AGMIAL: implementing an annotation strategy for prokaryote genomes as a distributed system. Nucleic Acids Res (2006) 10.39

A Bayesian framework for combining heterogeneous data sources for gene function prediction (in Saccharomyces cerevisiae). Proc Natl Acad Sci U S A (2003) 6.19

Automatic rule generation for protein annotation with the C4.5 data mining algorithm applied on SWISS-PROT. Bioinformatics (2001) 4.35

The complete genome sequence of the meat-borne lactic acid bacterium Lactobacillus sakei 23K. Nat Biotechnol (2005) 3.48

The complete genome sequence of Lactobacillus bulgaricus reveals extensive and ongoing reductive evolution. Proc Natl Acad Sci U S A (2006) 3.11

SubtiList: the reference database for the Bacillus subtilis genome. Nucleic Acids Res (2002) 2.99

Hierarchical multi-label prediction of gene function. Bioinformatics (2006) 2.62

GOPET: a tool for automated predictions of Gene Ontology terms. BMC Bioinformatics (2006) 1.75

Probabilistic annotation of protein sequences based on functional classifications. BMC Bioinformatics (2005) 1.12

Machine learning of functional class from phenotype data. Bioinformatics (2002) 0.94

Beyond the 'best' match: machine learning annotation of protein sequences by integration of different sources of information. Bioinformatics (2008) 0.86