Global protein function annotation through mining genome-scale data in yeast Saccharomyces cerevisiae.

PubWeight™: 1.71‹?› | Rank: Top 3%

🔗 View Article (PMC 535686)

Published in Nucleic Acids Res on December 07, 2004

Authors

Yu Chen1, Dong Xu

Author Affiliations

1: UT-ORNL Graduate School of Genome Science and Technology, Oak Ridge, TN, USA.

Articles citing this

A critical assessment of Mus musculus gene function prediction using integrated genomic evidence. Genome Biol (2008) 4.78

A genomewide functional network for the laboratory mouse. PLoS Comput Biol (2008) 2.25

Information theory applied to the sparse gene ontology annotation network to predict novel gene function. Bioinformatics (2007) 2.15

Predicting gene function in a hierarchical context with an ensemble of classifiers. Genome Biol (2008) 1.78

Inferring mouse gene functions from genomic-scale data using a combined functional network/classification strategy. Genome Biol (2008) 1.54

Transcript-level annotation of Affymetrix probesets improves the interpretation of gene expression data. BMC Bioinformatics (2007) 1.35

A factor analysis model for functional genomics. BMC Bioinformatics (2006) 1.26

Network-assisted protein identification and data interpretation in shotgun proteomics. Mol Syst Biol (2009) 1.19

Predicting gene function using hierarchical multi-label decision tree ensembles. BMC Bioinformatics (2010) 1.19

Gene function prediction using labeled and unlabeled data. BMC Bioinformatics (2008) 1.19

Predicting eukaryotic transcriptional cooperativity by Bayesian network integration of genome-wide data. Nucleic Acids Res (2009) 1.15

Growing functional modules from a seed protein via integration of protein interaction and gene expression data. BMC Bioinformatics (2007) 1.13

Quantitative assessment of relationship between sequence similarity and function similarity. BMC Genomics (2007) 1.12

Integrative approaches to the prediction of protein functions based on the feature selection. BMC Bioinformatics (2009) 1.01

An in silico method for detecting overlapping functional modules from composite biological networks. BMC Syst Biol (2008) 0.98

A systematic approach to infer biological relevance and biases of gene network structures. Nucleic Acids Res (2006) 0.95

Analysis on multi-domain cooperation for predicting protein-protein interactions. BMC Bioinformatics (2007) 0.95

Functional annotations for the Saccharomyces cerevisiae genome: the knowns and the known unknowns. Trends Microbiol (2009) 0.94

PlasmoDraft: a database of Plasmodium falciparum gene function predictions based on postgenomic data. BMC Bioinformatics (2008) 0.93

Matrix factorization-based data fusion for gene function prediction in baker's yeast and slime mold. Pac Symp Biocomput (2014) 0.90

A Review of Feature Selection and Feature Extraction Methods Applied on Microarray Data. Adv Bioinformatics (2015) 0.89

Scoring protein relationships in functional interaction networks predicted from sequence data. PLoS One (2011) 0.88

Genome wide prediction of protein function via a generic knowledge discovery approach based on evidence integration. BMC Bioinformatics (2006) 0.88

Global protein interactome exploration through mining genome-scale data in Arabidopsis thaliana. BMC Genomics (2010) 0.86

Peeling off the hidden genetic heterogeneities of cancers based on disease-relevant functional modules. Mol Med (2006) 0.86

Amino acid metabolic origin as an evolutionary influence on protein sequence in yeast. J Mol Evol (2009) 0.86

Integration of molecular network data reconstructs Gene Ontology. Bioinformatics (2014) 0.85

Protein function assignment through mining cross-species protein-protein interactions. PLoS One (2008) 0.85

An algorithm for finding biologically significant features in microarray data based on a priori manifold learning. PLoS One (2014) 0.84

Proteome-wide discovery of mislocated proteins in cancer. Genome Res (2013) 0.83

Using biological networks to improve our understanding of infectious diseases. Comput Struct Biotechnol J (2014) 0.83

M-BISON: microarray-based integration of data sources using networks. BMC Bioinformatics (2008) 0.81

Gene function hypotheses for the Campylobacter jejuni glycome generated by a logic-based approach. J Mol Biol (2012) 0.80

A novel method to identify cooperative functional modules: study of module coordination in the Saccharomyces cerevisiae cell cycle. BMC Bioinformatics (2011) 0.79

Gene divergence and pathway duplication in the metabolic network of yeast and digital organisms. J R Soc Interface (2009) 0.79

On the detection of functionally coherent groups of protein domains with an extension to protein annotation. BMC Bioinformatics (2007) 0.78

Using multi-instance hierarchical clustering learning system to predict yeast gene function. PLoS One (2014) 0.77

Identifying Significant Features in Cancer Methylation Data Using Gene Pathway Segmentation. Cancer Inform (2016) 0.75

Articles cited by this

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res (1997) 665.31

Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet (2000) 336.52

Improved tools for biological sequence comparison. Proc Natl Acad Sci U S A (1988) 193.60

Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci U S A (1998) 192.97

A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae. Nature (2000) 47.15

Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature (2002) 45.19

Life with 6000 genes. Science (1996) 41.51

Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry. Nature (2002) 37.66

Genomic expression programs in the response of yeast cells to environmental changes. Mol Biol Cell (2000) 36.09

Global analysis of protein localization in budding yeast. Nature (2003) 33.22

Assigning protein functions by comparative genome analysis: protein phylogenetic profiles. Proc Natl Acad Sci U S A (1999) 22.80

Knowledge-based analysis of microarray gene expression data by using support vector machines. Proc Natl Acad Sci U S A (2000) 19.39

The two-hybrid system: a method to identify and clone genes for proteins that interact with a protein of interest. Proc Natl Acad Sci U S A (1991) 15.52

A Bayesian networks approach for predicting protein-protein interactions from genomic data. Science (2003) 12.07

MIPS: a database for genomes and protein sequences. Nucleic Acids Res (2002) 11.98

Toward a protein-protein interaction map of the budding yeast: A comprehensive system to examine two-hybrid interactions in all possible combinations between the yeast proteins. Proc Natl Acad Sci U S A (2000) 10.98

A network of protein-protein interactions in yeast. Nat Biotechnol (2000) 10.31

A combined algorithm for genome-wide prediction of protein function. Nature (1999) 9.99

Global protein function prediction from protein-protein interaction networks. Nat Biotechnol (2003) 6.28

A Bayesian framework for combining heterogeneous data sources for gene function prediction (in Saccharomyces cerevisiae). Proc Natl Acad Sci U S A (2003) 6.19

PSORT-B: Improving protein subcellular localization prediction for Gram-negative bacteria. Nucleic Acids Res (2003) 4.25

Predicting protein function from protein/protein interaction data: a probabilistic approach. Bioinformatics (2003) 3.24

Whole-genome annotation by using evidence integration in functional-linkage networks. Proc Natl Acad Sci U S A (2004) 2.93

Assessment of prediction accuracy of protein function from protein--protein interaction data. Yeast (2001) 2.90

Predicting protein complex membership using probabilistic network reliability. Genome Res (2004) 1.79

The constraints protein-protein interactions place on sequence divergence. J Mol Biol (2002) 1.46

Predicting subcellular localization via protein motif co-occurrence. Genome Res (2004) 1.36

Predicting gene function in Saccharomyces cerevisiae. Bioinformatics (2003) 1.32

Understanding protein dispensability through machine-learning analysis of high-throughput data. Bioinformatics (2004) 1.24

Computational analyses of high-throughput protein-protein interaction data. Curr Protein Pept Sci (2003) 1.07

Cellular function prediction and biological pathway discovery in Arabidopsis thaliana using microarray data. Conf Proc IEEE Eng Med Biol Soc (2004) 0.85

Articles by these authors

(truncated to the top 100)

Genome sequence of the palaeopolyploid soybean. Nature (2010) 17.82

A critical assessment of Mus musculus gene function prediction using integrated genomic evidence. Genome Biol (2008) 4.78

Long-term monitoring shows hepatitis B virus resistance to entecavir in nucleoside-naïve patients is rare through 5 years of therapy. Hepatology (2009) 4.00

Enhanced computer vision with Microsoft Kinect sensor: a review. IEEE Trans Cybern (2013) 3.75

Slug antagonizes p53-mediated apoptosis of hematopoietic progenitors by repressing puma. Cell (2005) 3.73

Routine Microsecond Molecular Dynamics Simulations with AMBER on GPUs. 1. Generalized Born. J Chem Theory Comput (2012) 3.40

Ultradeep bisulfite sequencing analysis of DNA methylation patterns in multiple gene promoters by 454 sequencing. Cancer Res (2007) 3.40

Graph embedding and extensions: a general framework for dimensionality reduction. IEEE Trans Pattern Anal Mach Intell (2007) 3.06

Transcriptome dynamics of Deinococcus radiodurans recovering from ionizing radiation. Proc Natl Acad Sci U S A (2003) 2.52

Inferring gene regulatory networks from multiple microarray datasets. Bioinformatics (2006) 2.49

B lymphocyte stimulator overexpression in patients with systemic lupus erythematosus: longitudinal observations. Arthritis Rheum (2003) 2.47

An integrated transcriptome atlas of the crop model Glycine max, and its use in comparative analyses in plants. Plant J (2010) 2.41

Visualization of nitric oxide in living cells by a copper-based fluorescent probe. Nat Chem Biol (2006) 2.33

Local production of B lymphocyte stimulator protein and APRIL in arthritic joints of patients with inflammatory arthritis. Arthritis Rheum (2003) 2.21

Ensemble-based virtual screening reveals potential novel antiviral compounds for avian influenza neuraminidase. J Med Chem (2008) 2.14

Understanding the unique characteristics of suicide in China: national psychological autopsy study. Biomed Environ Sci (2005) 1.90

Clustering gene expression data using a graph-theoretic approach: an application of minimum spanning trees. Bioinformatics (2002) 1.81

Microarray analysis of chitin elicitation in Arabidopsis thaliana. Mol Plant Pathol (2002) 1.80

Genome-wide DNA methylation analysis reveals novel epigenetic changes in chronic lymphocytic leukemia. Epigenetics (2012) 1.80

Epidermal growth factor-like domain 7 protects endothelial cells from hyperoxia-induced cell death. Am J Physiol Lung Cell Mol Physiol (2007) 1.75

Evaluation of 50-mer oligonucleotide arrays for detecting microbial populations in environmental samples. Biotechniques (2004) 1.74

Epigenome-wide inheritance of cytosine methylation variants in a recombinant inbred population. Genome Res (2013) 1.72

Photoaffinity isolation and identification of proteins in cancer cell extracts that bind to platinum-modified DNA. Chembiochem (2009) 1.71

Efficacy of entecavir in chronic hepatitis B patients with mildly elevated alanine aminotransferase and biopsy-proven histological damage. Hepatology (2010) 1.61

Long-term treatment with entecavir induces reversal of advanced fibrosis or cirrhosis in patients with chronic hepatitis B. Clin Gastroenterol Hepatol (2010) 1.60

Musite, a tool for global prediction of general and kinase-specific phosphorylation sites. Mol Cell Proteomics (2010) 1.55

Comparisons among two fertile and three male-sterile mitochondrial genomes of maize. Genetics (2007) 1.53

No supernovae associated with two long-duration gamma-ray bursts. Nature (2006) 1.47

CUBIC: identification of regulatory binding sites through data clustering. J Bioinform Comput Biol (2003) 1.47

catena-Poly[[bis(thiocyanato-kappa N)cadmium(II)]-di-mu-thiourea-kappa(4)S:S]. Acta Crystallogr C (2002) 1.45

SNP discovery by high-throughput sequencing in soybean. BMC Genomics (2010) 1.45

Preoperative rosuvastatin protects patients with coronary artery disease undergoing noncardiac surgery. Cardiology (2015) 1.43

Single feature polymorphism discovery in rice. PLoS One (2007) 1.42

MUFOLD: A new solution for protein 3D structure prediction. Proteins (2010) 1.42

SoyDB: a knowledge database of soybean transcription factors. BMC Plant Biol (2010) 1.40

Characterizing loop dynamics and ligand recognition in human- and avian-type influenza neuraminidases via generalized born molecular dynamics and end-point free energy calculations. J Am Chem Soc (2009) 1.40

Prediction of novel miRNAs and associated target genes in Glycine max. BMC Bioinformatics (2010) 1.39

PRIMEGENS-v2: genome-wide primer design for analyzing DNA methylation patterns of CpG islands. Bioinformatics (2008) 1.35

Computational identification of protein methylation sites through bi-profile Bayes feature extraction. PLoS One (2009) 1.34

BAFF overexpression and accelerated glomerular disease in mice with an incomplete genetic predisposition to systemic lupus erythematosus. Arthritis Rheum (2005) 1.33

P3DB: a plant protein phosphorylation database. Nucleic Acids Res (2008) 1.31

Legume transcription factor genes: what makes legumes so special? Plant Physiol (2009) 1.30

Quantitative relationship between synonymous codon usage bias and GC composition across unicellular genomes. BMC Evol Biol (2004) 1.30

Soybean Knowledge Base (SoyKB): a web resource for soybean translational genomics. BMC Genomics (2012) 1.29

Transcriptional and physiological responses of Bradyrhizobium japonicum to desiccation-induced stress. J Bacteriol (2007) 1.29

Proteomic analysis of soybean root hairs after infection by Bradyrhizobium japonicum. Mol Plant Microbe Interact (2005) 1.27

Genomic and genetic evidence for the loss of umami taste in bats. Genome Biol Evol (2011) 1.27

Using Internet search engines to obtain medical information: a comparative study. J Med Internet Res (2012) 1.25

Genome-scale gene function prediction using multiple sources of high-throughput data in yeast Saccharomyces cerevisiae. OMICS (2004) 1.25

Understanding protein dispensability through machine-learning analysis of high-throughput data. Bioinformatics (2004) 1.24

A protocol for computer-based protein structure and function prediction. J Vis Exp (2011) 1.23

Root hair systems biology. Trends Plant Sci (2010) 1.21

Genome-scale probe and primer design with PRIMEGENS. Methods Mol Biol (2007) 1.20

A genome-wide association study in Han Chinese identifies a susceptibility locus for primary Sjögren's syndrome at 7q11.23. Nat Genet (2013) 1.19

Mosaic: making biological sense of complex networks. Bioinformatics (2012) 1.19

Homolog-specific PCR primer design for profiling splice variants. Nucleic Acids Res (2011) 1.19

PROSPECT II: protein structure prediction program for genome-scale applications. Protein Eng (2003) 1.19

Visual event recognition in videos by learning from Web data. IEEE Trans Pattern Anal Mach Intell (2012) 1.18

An oligonucleotide microarray resource for transcriptional profiling of Bradyrhizobium japonicum. Mol Plant Microbe Interact (2007) 1.17

Regulation of cellular oncosis by uncoupling protein 2. J Biol Chem (2002) 1.17

Contrast-enhanced whole-heart coronary MRA at 3.0T for the evaluation of cardiac venous anatomy. Int J Cardiovasc Imaging (2010) 1.17

ProteinDBS: a real-time retrieval system for protein structure comparison. Nucleic Acids Res (2004) 1.17

Identification of miRNA from Porphyra yezoensis by high-throughput sequencing and bioinformatics analysis. PLoS One (2010) 1.15

Targeted bisulfite sequencing by solution hybrid selection and massively parallel sequencing. Nucleic Acids Res (2011) 1.15

Establishment of a protein reference map for soybean root hair cells. Plant Physiol (2008) 1.14

Automated Tongue Feature Extraction for ZHENG Classification in Traditional Chinese Medicine. Evid Based Complement Alternat Med (2012) 1.12

Improving the performance of DomainParser for structural domain partition using neural network. Nucleic Acids Res (2003) 1.12

Quantitative assessment of relationship between sequence similarity and function similarity. BMC Genomics (2007) 1.12

Transcription profiling of soybean nodulation by Bradyrhizobium japonicum. Mol Plant Microbe Interact (2008) 1.11

Targeting oncogenic miR-335 inhibits growth and invasion of malignant astrocytoma cells. Mol Cancer (2011) 1.11

Multilinear discriminant analysis for face recognition. IEEE Trans Image Process (2007) 1.11

Clinical features and prognosis in adult-onset Still's disease: a study of 104 cases. Clin Rheumatol (2010) 1.08

Soybean knowledge base (SoyKB): a web resource for integration of soybean translational genomics and molecular breeding. Nucleic Acids Res (2013) 1.08

A computational method for assessing peptide- identification reliability in tandem mass spectrometry analysis with SEQUEST. Proteomics (2004) 1.08

Systems analysis of seed filling in Arabidopsis: using general linear modeling to assess concordance of transcript and protein expression. Plant Physiol (2010) 1.07

Computational analyses of high-throughput protein-protein interaction data. Curr Protein Pept Sci (2003) 1.07

Inhibition of MDR1 expression with altritol-modified siRNAs. Nucleic Acids Res (2007) 1.06

PDCD5 interacts with Tip60 and functions as a cooperator in acetyltransferase activity and DNA damage-induced apoptosis. Neoplasia (2009) 1.06

Genome-wide DNA methylation maps in follicular lymphoma cells determined by methylation-enriched bisulfite sequencing. PLoS One (2010) 1.06

ThreaDom: extracting protein domain boundary information from multiple threading alignments. Bioinformatics (2013) 1.05

Phylogenetic analysis using complete signature information of whole genomes and clustered Neighbour-Joining method. Int J Bioinform Res Appl (2006) 1.05

Identification of genes and pathways involved in kidney renal clear cell carcinoma. BMC Bioinformatics (2014) 1.05

A link between SIN1 (MAPKAP1) and poly(rC) binding protein 2 (PCBP2) in counteracting environmental stress. Proc Natl Acad Sci U S A (2008) 1.04

Phosphoproteomic analysis of seed maturation in Arabidopsis, rapeseed, and soybean. Plant Physiol (2012) 1.04

Whole genome co-expression analysis of soybean cytochrome P450 genes identifies nodulation-specific P450 monooxygenases. BMC Plant Biol (2010) 1.04

Quantitative phosphoproteomic analysis of soybean root hairs inoculated with Bradyrhizobium japonicum. Mol Cell Proteomics (2012) 1.03

Domain adaptation from multiple sources: a domain-dependent regularization approach. IEEE Trans Neural Netw Learn Syst (2012) 1.03

Image clustering using local discriminant models and global integration. IEEE Trans Image Process (2010) 1.03

A typical N-terminal extensions confer novel regulatory properties on GTP cyclohydrolase isoforms in Drosophila melanogaster. J Biol Chem (2006) 1.02

Bioinformatics and its applications in plant biology. Annu Rev Plant Biol (2006) 1.02

Correlation between posttranslational modification and intrinsic disorder in protein. Pac Symp Biocomput (2012) 1.02

MUFOLD-WQA: A new selective consensus method for quality assessment in protein structure prediction. Proteins (2011) 1.01

Domain transfer multiple kernel learning. IEEE Trans Pattern Anal Mach Intell (2012) 0.99

Large-scale analysis of putative soybean regulatory gene expression identifies a Myb gene involved in soybean nodule development. Plant Physiol (2009) 0.99

B lymphocyte stimulator protein-associated increase in circulating autoantibody levels may require CD4+ T cells: lessons from HIV-infected patients. Clin Immunol (2002) 0.98

A computational pipeline for protein structure prediction and analysis at genome scale. Bioinformatics (2003) 0.98

Mechanism of glycan receptor recognition and specificity switch for avian, swine, and human adapted influenza virus hemagglutinins: a molecular dynamics perspective. J Am Chem Soc (2009) 0.97

Numerical study of evaluating the optical quality of supersonic flow fields. Appl Opt (2007) 0.97

A comparative assessment and analysis of 20 representative sequence alignment methods for protein structure prediction. Sci Rep (2013) 0.97

Early biochemical response to ursodeoxycholic acid and long-term prognosis of primary biliary cirrhosis: results of a 14-year cohort study. Hepatology (2013) 0.96