Published in Genetica on January 01, 2000
The COG database: new developments in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res (2001) 43.17
OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res (2003) 33.03
MaGe: a microbial genome annotation system supported by synteny results. Nucleic Acids Res (2006) 12.48
Towards understanding the first genome sequence of a crenarchaeon by genome annotation using clusters of orthologous groups of proteins (COGs). Genome Biol (2000) 6.50
Expanded microbial genome coverage and improved protein family annotation in the COG database. Nucleic Acids Res (2014) 2.37
Comparative analysis of the Borrelia garinii genome. Nucleic Acids Res (2004) 1.96
Sodium ion cycle in bacterial pathogens: evidence from cross-genome comparisons. Microbiol Mol Biol Rev (2001) 1.95
Two C or not two C: recurrent disruption of Zn-ribbons, gene duplication, lineage-specific gene loss, and horizontal gene transfer in evolution of bacterial ribosomal proteins. Genome Biol (2001) 1.94
Evolutionary genomics of lactic acid bacteria. J Bacteriol (2006) 1.84
Genomic determinants of sporulation in Bacilli and Clostridia: towards the minimal set of sporulation-specific genes. Environ Microbiol (2012) 1.59
Congruent evolution of different classes of non-coding DNA in prokaryotic genomes. Nucleic Acids Res (2002) 1.54
Re-annotation of genome microbial coding-sequences: finding new genes and inaccurately annotated genes. BMC Bioinformatics (2002) 1.33
Natural selection of more designable folds: a mechanism for thermophilic adaptation. Proc Natl Acad Sci U S A (2003) 1.24
Practical and theoretical advances in predicting the function of a protein by its phylogenetic distribution. J R Soc Interface (2008) 1.24
Large gene overlaps in prokaryotic genomes: result of functional constraints or mispredictions? BMC Genomics (2008) 1.22
Conserved 'hypothetical' proteins: new hints and new puzzles. Comp Funct Genomics (2001) 1.22
PhyloPat: phylogenetic pattern analysis of eukaryotic genes. BMC Bioinformatics (2006) 1.19
A pluralistic account of homology: adapting the models to the data. Mol Biol Evol (2013) 0.89
Complete Genomes of Bacillus coagulans S-lac and Bacillus subtilis TO-A JPC, Two Phylogenetically Distinct Probiotics. PLoS One (2016) 0.87
The Seventh International Conference on the Genetics of Streptococci, Lactococci, and Enterococci. J Bacteriol (2006) 0.85
Comparative genomic analysis of Helicobacter pylori from Malaysia identifies three distinct lineages suggestive of differential evolution. Nucleic Acids Res (2014) 0.84
PhyloPat: an updated version of the phylogenetic pattern database contains gene neighborhood. Nucleic Acids Res (2008) 0.83
Genome sequence of Lactobacillus salivarius SMXD51, a potential probiotic strain isolated from chicken cecum, showing anti-campylobacter activity. J Bacteriol (2012) 0.81
Genome-wide study of the defective sucrose fermenter strain of Vibrio cholerae from the Latin American cholera epidemic. PLoS One (2012) 0.81
Global changes in gene expression by the opportunistic pathogen Burkholderia cenocepacia in response to internalization by murine macrophages. BMC Genomics (2012) 0.80
PairWise Neighbours database: overlaps and spacers among prokaryote genomes. BMC Genomics (2009) 0.79
iTRAQ-based analysis of developmental dynamics in the soybean leaf proteome reveals pathways associated with leaf photosynthetic rate. Mol Genet Genomics (2016) 0.79
Re-annotation of protein-coding genes in 10 complete genomes of Neisseriaceae family by combining similarity-based and composition-based methods. DNA Res (2013) 0.78
Adaptation of the short intergenic spacers between co-directional genes to the Shine-Dalgarno motif among prokaryote genomes. BMC Genomics (2009) 0.78
The mysterious orphans of Mycoplasmataceae. Biol Direct (2016) 0.77
The dam replacing gene product enhances Neisseria gonorrhoeae FA1090 viability and biofilm formation. Front Microbiol (2014) 0.77
Environmental Pressure May Change the Composition Protein Disorder in Prokaryotes. PLoS One (2015) 0.76
Type III Methyltransferase M.NgoAX from Neisseria gonorrhoeae FA1090 Regulates Biofilm Formation and Interactions with Human Cells. Front Microbiol (2015) 0.76
Initial sequencing and analysis of the human genome. Nature (2001) 212.86
A genomic perspective on protein families. Science (1997) 50.51
The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res (2000) 49.22
The COG database: new developments in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res (2001) 43.17
Protein sequence similarity searches using patterns as seeds. Nucleic Acids Res (1998) 23.87
Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements. Nucleic Acids Res (2001) 22.33
Detection of conserved segments in proteins: iterative scanning of sequence databases with alignment blocks. Proc Natl Acad Sci U S A (1994) 18.46
A superfamily of conserved domains in DNA damage-responsive cell cycle checkpoint proteins. FASEB J (1997) 15.10
Genome sequence of an obligate intracellular pathogen of humans: Chlamydia trachomatis. Science (1998) 13.64
BRCA1 protein products ... Functional motifs... Nat Genet (1996) 11.50
AAA+: A class of chaperone-like ATPases associated with the assembly, operation, and disassembly of protein complexes. Genome Res (1999) 11.30
Bacterial rhodopsin: evidence for a new type of phototrophy in the sea. Science (2000) 10.82
Two related superfamilies of putative helicases involved in replication, recombination, repair and expression of DNA and RNA genomes. Nucleic Acids Res (1989) 10.03
Comparison of archaeal and bacterial genomes: computer analysis of protein sequences predicts novel functions and suggests a chimeric origin for the archaea. Mol Microbiol (1997) 8.69
Novel domains of the prokaryotic two-component signal transduction systems. FEMS Microbiol Lett (2001) 8.45
Iterated profile searches with PSI-BLAST--a tool for discovery in protein databases. Trends Biochem Sci (1998) 8.01
Microbial culturomics: paradigm shift in the human gut microbiome study. Clin Microbiol Infect (2012) 7.97
IMPALA: matching a protein sequence against a collection of PSI-BLAST-constructed position-specific score matrices. Bioinformatics (1999) 6.91
Towards understanding the first genome sequence of a crenarchaeon by genome annotation using clusters of orthologous groups of proteins (COGs). Genome Biol (2000) 6.50
Genome sequence and comparative analysis of the solvent-producing bacterium Clostridium acetobutylicum. J Bacteriol (2001) 6.46
A minimal gene set for cellular life derived by comparison of complete bacterial genomes. Proc Natl Acad Sci U S A (1996) 6.38
Comparison of the complete protein sets of worm and yeast: orthology and divergence. Science (1998) 6.28
Sources of systematic error in functional annotation of genomes: domain rearrangement, non-orthologous gene displacement and operon disruption. In Silico Biol (1998) 6.22
Gleaning non-trivial structural, functional and evolutionary information about proteins by iterative database searches. J Mol Biol (1999) 5.90
Cleavage of cohesin by the CD clan protease separin triggers anaphase in yeast. Cell (2000) 5.69
Metabolism and evolution of Haemophilus influenzae deduced from a whole-genome comparison with Escherichia coli. Curr Biol (1996) 5.50
Genome alignment, evolution of prokaryotic genome organization, and prediction of gene function using genomic context. Genome Res (2001) 5.37
Beyond complete genomes: from sequence to structure and function. Curr Opin Struct Biol (1998) 5.16
Conserved domains in DNA repair proteins and evolution of repair systems. Nucleic Acids Res (1999) 4.80
Genome of the extremely radiation-resistant bacterium Deinococcus radiodurans viewed from the perspective of comparative genomics. Microbiol Mol Biol Rev (2001) 4.75
Genome trees constructed using five different approaches suggest new major bacterial clades. BMC Evol Biol (2001) 4.59
Positionally cloned human disease genes: patterns of evolutionary conservation and functional motifs. Proc Natl Acad Sci U S A (1997) 4.40
The DNA-repair protein AlkB, EGL-9, and leprecan define new families of 2-oxoglutarate- and iron-dependent dioxygenases. Genome Biol (2001) 4.39
Conserved sequence motifs in the initiator proteins for rolling circle DNA replication encoded by diverse replicons from eubacteria, eucaryotes and archaebacteria. Nucleic Acids Res (1992) 4.39
Cysteine proteases of positive strand RNA viruses and chymotrypsin-like serine proteases. A distinct protein superfamily with a common structural fold. FEBS Lett (1989) 4.29
Common origin of four diverse families of large eukaryotic DNA viruses. J Virol (2001) 4.28
N-terminal domains of putative helicases of flavi- and pestiviruses may be serine proteases. Nucleic Acids Res (1989) 4.27
Identification of paracaspases and metacaspases: two ancient families of caspase-like proteins, one of which plays a key role in MALT lymphoma. Mol Cell (2000) 4.19
The HD domain defines a new superfamily of metal-dependent phosphohydrolases. Trends Biochem Sci (1998) 4.11
SAP - a putative DNA-binding motif involved in chromosomal organization. Trends Biochem Sci (2000) 4.10
Evidence for massive gene exchange between archaeal and bacterial hyperthermophiles. Trends Genet (1998) 4.10
The complete sequence (22 kilobases) of murine coronavirus gene 1 encoding the putative proteases and RNA polymerase. Virology (1991) 4.07
Who's your neighbor? New computational approaches for functional genomics. Nat Biotechnol (2000) 4.04
Construction and analysis of bacterial artificial chromosome libraries from a marine microbial assemblage. Environ Microbiol (2000) 3.99
Viral proteins containing the purine NTP-binding sequence pattern. Nucleic Acids Res (1989) 3.96
Chromosome 2 sequence of the human malaria parasite Plasmodium falciparum. Science (1998) 3.83
Comparative genomics of the Archaea (Euryarchaeota): evolution of conserved protein families, the stable core, and the variable shell. Genome Res (1999) 3.62
SURVEY AND SUMMARY: holliday junction resolvases and related nucleases: identification of new families, phyletic distribution and evolutionary trajectories. Nucleic Acids Res (2000) 3.56
Evolution of aminoacyl-tRNA synthetases--analysis of unique domain architectures and phylogenetic trees reveals a complex history of horizontal gene transfer events. Genome Res (1999) 3.44
Coronavirus genome: prediction of putative functional domains in the non-structural polyprotein by comparative amino acid sequence analysis. Nucleic Acids Res (1989) 3.38
DNA polymerase beta-like nucleotidyltransferase superfamily: identification of three new families, classification and evolutionary history. Nucleic Acids Res (1999) 3.32
A new superfamily of putative NTP-binding domains encoded by genomes of small DNA and RNA viruses. FEBS Lett (1990) 3.30
Eukaryotic signalling domain homologues in archaea and bacteria. Ancient ancestry and horizontal gene transfer. J Mol Biol (1999) 3.28
Lineage-specific loss and divergence of functionally linked genes in eukaryotes. Proc Natl Acad Sci U S A (2000) 3.27
Non-orthologous gene displacement. Trends Genet (1996) 3.12
Lineage-specific gene expansions in bacterial and archaeal genomes. Genome Res (2001) 3.10
A novel superfamily of nucleoside triphosphate-binding motif containing proteins which are probably involved in duplex unwinding in DNA and RNA replication and recombination. FEBS Lett (1988) 3.09
Applications of network BLAST server. Methods Enzymol (1996) 3.03
An NTP-binding motif is the most conserved sequence in a highly diverged monophyletic group of proteins involved in positive strand RNA viral replication. J Mol Evol (1989) 3.00
A conserved NTP-motif in putative helicases. Nature (1988) 2.94
Novel families of putative protein kinases in bacteria and archaea: evolution of the "eukaryotic" protein kinase superfamily. Genome Res (1998) 2.91
Detection of new genes in a bacterial genome using Markov models for three gene classes. Nucleic Acids Res (1995) 2.89
Rickettsiae and Chlamydiae: evidence of horizontal gene transfer and gene exchange. Trends Genet (1999) 2.87
Predicting functions from protein sequences--where are the bottlenecks? Nat Genet (1998) 2.86
Toprim--a conserved catalytic domain in type IA and II topoisomerases, DnaG-type primases, OLD family nucleases and RecR proteins. Nucleic Acids Res (1998) 2.82
SEALS: a system for easy analysis of lots of sequences. Proc Int Conf Intell Syst Mol Biol (1997) 2.80
Prediction of transcription regulatory sites in Archaea by a comparative genomic approach. Nucleic Acids Res (2000) 2.80
RNA sequence of astrovirus: distinctive genomic organization and a putative retrovirus-like ribosomal frameshifting signal that directs the viral replicase synthesis. Proc Natl Acad Sci U S A (1993) 2.77
Role of CED-4 in the activation of CED-3. Nature (1997) 2.76
The U box is a modified RING finger - a common domain in ubiquitination. Curr Biol (2000) 2.73
Regulatory potential, phyletic distribution and evolution of ancient, intracellular small-molecule-binding domains. J Mol Biol (2001) 2.68
Adaptations of the helix-grip fold for ligand binding and catalysis in the START domain superfamily. Proteins (2001) 2.65
Phosphoesterase domains associated with DNA polymerases of diverse origins. Nucleic Acids Res (1998) 2.61
The NACHT family - a new group of predicted NTPases implicated in apoptosis and MHC transcription activation. Trends Biochem Sci (2000) 2.58
The genome of molluscum contagiosum virus: analysis and comparison with other poxviruses. Virology (1997) 2.58
The domains of death: evolution of the apoptosis machinery. Trends Biochem Sci (1999) 2.56
DNA-binding proteins and evolution of transcription regulation in the archaea. Nucleic Acids Res (1999) 2.53
A diverse superfamily of enzymes with ATP-dependent carboxylate-amine/thiol ligase activity. Protein Sci (1997) 2.47
Gene order is not conserved in bacterial evolution. Trends Genet (1996) 2.44
Did DNA replication evolve twice independently? Nucleic Acids Res (1999) 2.44
Computer analysis of bacterial haloacid dehalogenases defines a large superfamily of hydrolases with diverse specificity. Application of an iterative approach to database search. J Mol Biol (1994) 2.41
Hedgehog patterning activity: role of a lipophilic modification mediated by the carboxy-terminal autoprocessing domain. Cell (1996) 2.39
The bacterial replicative helicase DnaB evolved from a RecA duplication. Genome Res (2000) 2.39
The catalytic domain of the P-type ATPase has the haloacid dehalogenase fold. Trends Biochem Sci (1998) 2.39
Fold prediction and evolutionary analysis of the POZ domain: structural and evolutionary relationship with the potassium channel tetramerization domain. J Mol Biol (1999) 2.37
Superfamily of UvrA-related NTP-binding proteins. Implications for rational classification of recombination/repair systems. J Mol Biol (1990) 2.34
A novel family of predicted phosphoesterases includes Drosophila prune protein and bacterial RecJ exonuclease. Trends Biochem Sci (1998) 2.33