Protein ranking: from local to global structure in the protein similarity network.

PubWeight™: 1.67‹?› | Rank: Top 3%

🔗 View Article (PMC 404084)

Published in Proc Natl Acad Sci U S A on April 15, 2004

Authors

Jason Weston1, Andre Elisseeff, Dengyong Zhou, Christina S Leslie, William Stafford Noble

Author Affiliations

1: NEC Laboratories America, 4 Independence Way, Princeton, NJ 08540, USA.

Articles citing this

Finding friends and enemies in an enemies-only network: a graph diffusion kernel for predicting novel genetic interactions and co-complex membership from yeast genetic interactions. Genome Res (2008) 1.61

"Guilt by association" is the exception rather than the rule in gene networks. PLoS Comput Biol (2012) 1.45

The role of indirect connections in gene networks in predicting function. Bioinformatics (2011) 1.10

Protein remote homology detection by combining Chou's distance-pair pseudo amino acid composition and principal component analysis. Mol Genet Genomics (2015) 1.00

Efficient computation of k-Nearest Neighbour Graphs for large high-dimensional data sets on GPU clusters. PLoS One (2013) 0.98

Family classification without domain chaining. Bioinformatics (2009) 0.96

RIDDLE: reflective diffusion and local extension reveal functional associations for unannotated gene sets via proximity in a gene network. Genome Biol (2012) 0.95

Protein ranking by semi-supervised network propagation. BMC Bioinformatics (2006) 0.95

RANKPROP: a web server for protein remote homology detection. Bioinformatics (2008) 0.90

A pluralistic account of homology: adapting the models to the data. Mol Biol Evol (2013) 0.89

Assessing the functional coherence of modules found in multiple-evidence networks from Arabidopsis. BMC Bioinformatics (2011) 0.89

A new method to improve network topological similarity search: applied to fold recognition. Bioinformatics (2015) 0.86

Systematic differences in signal emitting and receiving revealed by PageRank analysis of a human protein interactome. PLoS One (2012) 0.85

Functional enrichment analyses and construction of functional similarity networks with high confidence function prediction by PFP. BMC Bioinformatics (2010) 0.85

Fast k-NNG construction with GPU-based quick multi-select. PLoS One (2014) 0.84

Detecting remote evolutionary relationships among proteins by large-scale semantic embedding. PLoS Comput Biol (2011) 0.82

Labeling nodes using three degrees of propagation. PLoS One (2012) 0.82

A discriminative method for family-based protein remote homology detection that combines inductive logic programming and propositional models. BMC Bioinformatics (2011) 0.80

Adaptive diffusion kernel learning from biological networks for protein function prediction. BMC Bioinformatics (2008) 0.79

Massive fungal biodiversity data re-annotation with multi-level clustering. Sci Rep (2014) 0.79

Optimal scaling of digital transcriptomes. PLoS One (2013) 0.77

Identifying problematic drugs based on the characteristics of their targets. Front Pharmacol (2015) 0.75

Heat-passing framework for robust interpretation of data in networks. PLoS One (2015) 0.75

Learning virulent proteins from integrated query networks. BMC Bioinformatics (2012) 0.75

Finding biomarkers in non-model species: literature mining of transcription factors involved in bovine embryo development. BioData Min (2012) 0.75

Community detection in sequence similarity networks based on attribute clustering. PLoS One (2017) 0.75

Network propagation: a universal amplifier of genetic associations. Nat Rev Genet (2017) 0.75

Articles cited by this

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res (1997) 665.31

Basic local alignment search tool. J Mol Biol (1990) 659.07

Identification of common molecular subsequences. J Mol Biol (1981) 130.53

The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology (1982) 79.76

SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol (1995) 74.88

Hidden Markov models in computational biology. Applications to protein modeling. J Mol Biol (1994) 31.57

A global geometric framework for nonlinear dimensionality reduction. Science (2000) 23.62

Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements. Nucleic Acids Res (2001) 22.33

Nonlinear dimensionality reduction by locally linear embedding. Science (2000) 19.41

Detection of conserved segments in proteins: iterative scanning of sequence databases with alignment blocks. Proc Natl Acad Sci U S A (1994) 18.46

Rapid and sensitive sequence comparison with FASTP and FASTA. Methods Enzymol (1990) 17.64

Profile analysis. Methods Enzymol (1990) 9.40

Sequence comparisons using multiple sequences detect three times as many remote homologues as pairwise methods. J Mol Biol (1998) 9.09

Use of receiver operating characteristic (ROC) analysis to evaluate sequence matching. Comput Chem (1996) 5.16

Hidden Markov models for sequence analysis: extension and analysis of the basic method. Comput Appl Biosci (1996) 4.58

Intermediate sequences increase the detection of homology between sequences. J Mol Biol (1997) 2.98

ProtoMap: automatic classification of protein sequences, a hierarchy of protein families, and local maps of the protein space. Proteins (1999) 2.04

Observation of phase transitions in spreading activation networks. Science (1987) 1.43

Articles by these authors

Assessing computational tools for the discovery of transcription factor binding sites. Nat Biotechnol (2005) 14.29

Quantifying similarity between motifs. Genome Biol (2007) 9.27

Semi-supervised learning for peptide identification from shotgun proteomics datasets. Nat Methods (2007) 8.94

FIMO: scanning for occurrences of a given motif. Bioinformatics (2011) 8.89

Searching for statistically significant regulatory modules. Bioinformatics (2003) 5.72

Assigning significance to peptides identified by tandem mass spectrometry using decoy databases. J Proteome Res (2007) 5.57

Matrix2png: a utility for visualizing matrix data. Bioinformatics (2003) 5.31

CSF-1R inhibition alters macrophage polarization and blocks glioma progression. Nat Med (2013) 5.04

Nucleosome positioning signals in genomic DNA. Genome Res (2007) 4.99

The spectrum kernel: a string kernel for SVM protein classification. Pac Symp Biocomput (2002) 4.90

Unsupervised pattern discovery in human chromatin structure through genomic segmentation. Nat Methods (2012) 4.89

Analysis of peptide MS/MS spectra from large-scale proteomics experiments using spectrum libraries. Anal Chem (2006) 4.16

Integrative annotation of chromatin elements from ENCODE data. Nucleic Acids Res (2012) 3.80

Transfection of small RNAs globally perturbs gene regulation by endogenous microRNAs. Nat Biotechnol (2009) 3.66

A statistical framework for genomic data fusion. Bioinformatics (2004) 3.64

The Forkhead transcription factor Hcm1 regulates chromosome segregation genes and fills the S-phase gap in the transcriptional circuitry of the cell cycle. Genes Dev (2006) 3.58

Transmembrane topology and signal peptide prediction using dynamic bayesian networks. PLoS Comput Biol (2008) 3.56

Kernel methods for predicting protein-protein interactions. Bioinformatics (2005) 3.52

Exploring gene expression data with class scores. Pac Symp Biocomput (2002) 3.51

Target mRNA abundance dilutes microRNA and siRNA activity. Mol Syst Biol (2010) 3.37

Mismatch string kernels for discriminative protein classification. Bioinformatics (2004) 3.27

Learning gene functional classifications from multiple data types. J Comput Biol (2002) 2.65

A stability based method for discovering structure in clustered data. Pac Symp Biocomput (2002) 2.62

Posterior error probabilities and false discovery rates: two sides of the same coin. J Proteome Res (2007) 2.61

Transcriptome-wide miR-155 binding map reveals widespread noncanonical microRNA targeting. Mol Cell (2012) 2.54

The effect of replication on gene expression microarray experiments. Bioinformatics (2003) 2.49

Sequence information for the splicing of human pre-mRNA identified by support vector machine classification. Genome Res (2003) 2.42

Genome-wide RNA-mediated interference screen identifies miR-19 targets in Notch-induced T-cell acute lymphoblastic leukaemia. Nat Cell Biol (2010) 2.40

Choosing negative examples for the prediction of protein-protein interactions. BMC Bioinformatics (2006) 2.36

Sequence and chromatin determinants of cell-type-specific transcription factor binding. Genome Res (2012) 2.24

A new algorithm for the evaluation of shotgun peptide sequencing in proteomics: support vector machine classification of peptide MS/MS spectra and SEQUEST scores. J Proteome Res (2003) 2.16

Combining pairwise sequence similarity and support vector machines for detecting remote protein evolutionary and structural relationships. J Comput Biol (2003) 2.04

Large-scale identification of yeast integral membrane protein interactions. Proc Natl Acad Sci U S A (2005) 2.03

A cooperative microRNA-tumor suppressor gene network in acute T-cell lymphoblastic leukemia (T-ALL). Nat Genet (2011) 1.99

Peptide charge state determination for low-resolution tandem mass spectra. Proc IEEE Comput Syst Bioinform Conf (2005) 1.97

Efficient marginalization to compute protein posterior probabilities from shotgun mass spectrometry data. J Proteome Res (2010) 1.91

Use of shotgun proteomics for the identification, confirmation, and correction of C. elegans gene annotations. Genome Res (2008) 1.90

Learning to predict protein-protein interactions from protein sequences. Bioinformatics (2003) 1.84

Improvements to the percolator algorithm for Peptide identification from shotgun proteomics data sets. J Proteome Res (2009) 1.80

Ubiquitously transcribed genes use alternative polyadenylation to achieve tissue-specific expression. Genes Dev (2013) 1.74

Predicting human nucleosome occupancy from primary sequence. PLoS Comput Biol (2008) 1.66

Support vector machine classification on the web. Bioinformatics (2004) 1.65

High resolution models of transcription factor-DNA affinities improve in vitro and in vivo binding predictions. PLoS Comput Biol (2010) 1.52

Modeling peptide fragmentation with dynamic Bayesian networks for peptide identification. Bioinformatics (2008) 1.48

Epigenetic priors for identifying active transcription factor binding sites. Bioinformatics (2011) 1.47

Non-parametric estimation of posterior error probabilities associated with peptides identified by tandem mass spectrometry. Bioinformatics (2008) 1.47

Predicting co-complexed protein pairs from heterogeneous data. PLoS Comput Biol (2008) 1.44

Statistical calibration of the SEQUEST XCorr function. J Proteome Res (2009) 1.43

Dichotomous splicing signals in exon flanks. Genome Res (2005) 1.42

Semi-supervised protein classification using cluster kernels. Bioinformatics (2005) 1.42

Ranking predicted protein structures with support vector regression. Proteins (2008) 1.41

QVALITY: non-parametric estimation of q-values and posterior error probabilities. Bioinformatics (2009) 1.38

Inferring transcriptional and microRNA-mediated regulatory programs in glioblastoma. Mol Syst Biol (2012) 1.36

Faster SEQUEST searching for peptide identification from tandem mass spectra. J Proteome Res (2011) 1.33

Computational and experimental identification of mirtrons in Drosophila melanogaster and Caenorhabditis elegans. Genome Res (2010) 1.31

Support vector machine learning from heterogeneous data: an empirical analysis using protein sequence and structure. Bioinformatics (2006) 1.27

Improving tandem mass spectrum identification using peptide retention time prediction across diverse chromatography conditions. Anal Chem (2007) 1.19

Riboproteomics of the hepatitis C virus internal ribosomal entry site. J Proteome Res (2004) 1.19

Consistent probabilistic outputs for protein function prediction. Genome Biol (2008) 1.17

Computational searches for splicing signals. Methods (2005) 1.15

On the assessment of statistical significance of three-dimensional colocalization of sets of genomic elements. Nucleic Acids Res (2012) 1.13

Improved similarity scores for comparing motifs. Bioinformatics (2011) 1.10

Detecting cross-linked peptides by searching against a database of cross-linked peptide pairs. J Proteome Res (2010) 1.09

Crux: rapid open source protein tandem mass spectrometry analysis. J Proteome Res (2014) 1.06

The Genomedata format for storing large-scale functional genomics data. Bioinformatics (2010) 1.05

Protein backbone angle prediction with machine learning approaches. Bioinformatics (2004) 1.04

On using samples of known protein content to assess the statistical calibration of scores assigned to peptide-spectrum matches in shotgun proteomics. J Proteome Res (2011) 1.03

Exploratory analysis of genomic segmentations with Segtools. BMC Bioinformatics (2011) 1.02

Learning kernels from biological networks by maximizing entropy. Bioinformatics (2004) 1.01

Automated mapping of large-scale chromatin structure in ENCODE. Bioinformatics (2008) 0.99

Cooperative control of tumor suppressor genes by a network of oncogenic microRNAs. Cell Cycle (2011) 0.99

Estimating relative abundances of proteins from shotgun proteomics data. BMC Bioinformatics (2012) 0.98

Kernel hierarchical gene clustering from microarray expression data. Bioinformatics (2003) 0.98

Direct maximization of protein identifications from tandem mass spectra. Mol Cell Proteomics (2011) 0.98

A structural alignment kernel for protein structures. Bioinformatics (2007) 0.97

Assessing phylogenetic motif models for predicting transcription factor binding sites. Bioinformatics (2009) 0.95

Protein ranking by semi-supervised network propagation. BMC Bioinformatics (2006) 0.95

A thermodynamic approach to PCR primer design. Nucleic Acids Res (2009) 0.94

RANKPROP: a web server for protein remote homology detection. Bioinformatics (2008) 0.90

Motif-based protein ranking by network propagation. Bioinformatics (2005) 0.90

Improved network-based identification of protein orthologs. Bioinformatics (2008) 0.88

On the importance of well-calibrated scores for identifying shotgun proteomics spectra. J Proteome Res (2014) 0.88

Multiple functional categories of proteins identified in an in vitro cellular ubiquitin affinity extract using shotgun peptide sequencing. J Proteome Res (2003) 0.87

Global profiling of stimulus-induced polyadenylation in cells using a poly(A) trap. Nat Chem Biol (2013) 0.87

Faster mass spectrometry-based protein inference: junction trees are more efficient than sampling and marginalization by enumeration. IEEE/ACM Trans Comput Biol Bioinform (2012) 0.86

Learning a weighted sequence model of the nucleosome core and linker yields more accurate predictions in Saccharomyces cerevisiae and Homo sapiens. PLoS Comput Biol (2010) 0.86

A cross-validation scheme for machine learning algorithms in shotgun proteomics. BMC Bioinformatics (2012) 0.85

A unified multitask architecture for predicting local protein properties. PLoS One (2012) 0.84

A learned comparative expression measure for affymetrix genechip DNA microarrays. Proc IEEE Comput Syst Bioinform Conf (2005) 0.83

Combining classifiers for improved classification of proteins from sequence or structure. BMC Bioinformatics (2008) 0.83

Protein family classification using sparse markov transducers. J Comput Biol (2003) 0.82

Multiple dimensions of epigenetic gene regulation in the malaria parasite Plasmodium falciparum: gene regulation via histone modifications, nucleosome positioning and nuclear architecture in P. falciparum. Bioessays (2014) 0.82

Detecting remote evolutionary relationships among proteins by large-scale semantic embedding. PLoS Comput Biol (2011) 0.82

Efficient identification of DNA hybridization partners in a sequence database. Bioinformatics (2006) 0.81

Learning score function parameters for improved spectrum identification in tandem mass spectrometry experiments. J Proteome Res (2012) 0.77

Using substitution matrices to estimate probability distributions for biological sequences. J Comput Biol (2002) 0.76

Data hoarding is harming proteomics. Nat Biotechnol (2004) 0.75

Automated validation of polymerase chain reaction amplicon melting curves. J Bioinform Comput Biol (2006) 0.75