Knowledge-based expert systems and a proof-of-concept case study for multiple sequence alignment construction and analysis.

PubWeight™: 0.80‹?›

🔗 View Article (PMC 2638625)

Published in Brief Bioinform on October 29, 2008

Authors

Mohamed Radhouene Aniba1, Sophie Siguenza, Anne Friedrich, Frédéric Plewniak, Olivier Poch, Aron Marchler-Bauer, Julie Dawn Thompson

Author Affiliations

1: Institut de Génétique et de Biologie Moléculaire et Cellulaire (IGBMC), F-67400 Illkirch, France.

Articles cited by this

MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res (2004) 168.89

T-Coffee: A novel method for fast and accurate multiple sequence alignment. J Mol Biol (2000) 57.88

MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res (2002) 47.62

Progressive sequence alignment as a prerequisite to correct phylogenetic trees. J Mol Evol (1987) 41.41

Multiple sequence alignment with the Clustal series of programs. Nucleic Acids Res (2003) 38.75

Entrez: molecular biology database and retrieval system. Methods Enzymol (1996) 30.67

BLAST 2 Sequences, a new tool for comparing protein and nucleotide sequences. FEMS Microbiol Lett (1999) 25.40

SRS: information retrieval system for molecular biology data banks. Methods Enzymol (1996) 24.30

BIONJ: an improved version of the NJ algorithm based on a simple model of sequence data. Mol Biol Evol (1997) 17.52

Pattern-induced multi-sequence alignment (PIMA) algorithm employing secondary structure-dependent gap penalties for use in comparative protein modelling. Protein Eng (1992) 14.40

A comprehensive comparison of multiple sequence alignment programs. Nucleic Acids Res (1999) 12.56

ProbCons: Probabilistic consistency-based multiple sequence alignment. Genome Res (2005) 11.90

BioMOBY: an open source biological web services proposal. Brief Bioinform (2002) 10.06

Comprehensive study on iterative algorithms of multiple sequence alignment. Comput Appl Biosci (1995) 8.39

Clearcut: a fast implementation of relaxed neighbor joining. Bioinformatics (2006) 6.99

BAliBASE 3.0: latest developments of the multiple sequence alignment benchmark. Proteins (2005) 6.57

3DCoffee: combining protein sequences and structures within multiple sequence alignments. J Mol Biol (2004) 6.01

DbClustal: rapid and reliable global multiple alignments of protein sequences detected by database searches. Nucleic Acids Res (2000) 5.70

Fast and accurate phylogeny reconstruction algorithms based on the minimum-evolution principle. J Comput Biol (2002) 5.50

Integrating 'omic' information: a bridge between genomics and systems biology. Trends Genet (2003) 4.09

PRALINE: a multiple sequence alignment toolbox that integrates homology-extended and secondary structure information. Nucleic Acids Res (2005) 3.15

Towards a reliable objective function for multiple sequence alignments. J Mol Biol (2001) 3.07

RASCAL: rapid scanning and correction of multiple sequence alignments. Bioinformatics (2003) 2.79

An open-source framework for large-scale, flexible evaluation of biomedical text mining systems. J Biomed Discov Collab (2008) 2.07

FIGENIX: intelligent automation of genomic annotation: expertise integration in a new software platform. BMC Bioinformatics (2005) 1.97

PipeAlign: A new toolkit for protein family analysis. Nucleic Acids Res (2003) 1.78

Automated extraction of ejection fraction for quality measurement using regular expressions in Unstructured Information Management Architecture (UIMA) for heart failure. J Am Med Inform Assoc (2012) 1.72

Computational biology. Bioinformatics--trying to swim in a sea of data. Science (2001) 1.64

A fuzzy logic approach to analyzing gene expression data. Physiol Genomics (2000) 1.54

Automation of in-silico data analysis processes through workflow management systems. Brief Bioinform (2007) 1.51

Case-based reasoning in the health sciences: What's next? Artif Intell Med (2006) 1.29

Capturing expert knowledge with argumentation: a case study in bioinformatics. Bioinformatics (2006) 1.27

Refining multiple sequence alignments with conserved core regions. Nucleic Acids Res (2006) 1.15

Evaluation of iterative alignment algorithms for multiple alignment. Bioinformatics (2004) 1.14

Systems biology by the rules: hybrid intelligent systems for pathway modeling and discovery. BMC Syst Biol (2007) 1.13

The effect of the guide tree on multiple sequence alignments and subsequent phylogenetic analyses. Pac Symp Biocomput (2008) 1.07

Overview of commonly used bioinformatics methods and their applications. Ann N Y Acad Sci (2004) 1.02

Biomediator data integration and inference for functional annotation of anonymous sequences. Pac Symp Biocomput (2007) 1.02

Constructing computational pipelines. Methods Mol Biol (2008) 0.87

Analyzing functional similarity of protein sequences with discrete wavelet transform. Comput Biol Chem (2005) 0.86

Microarray data classified by artificial neural networks. Methods Mol Biol (2007) 0.84

MaSTerClass: a case-based reasoning system for the classification of biomedical terms. Bioinformatics (2005) 0.78

Articles by these authors

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2009) 12.51

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2010) 10.97

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2011) 8.62

MMDB: annotating protein sequences with Entrez's 3D-structure database. Nucleic Acids Res (2006) 7.57

BAliBASE 3.0: latest developments of the multiple sequence alignment benchmark. Proteins (2005) 6.57

EGFR and EphA2 are host factors for hepatitis C virus entry and possible targets for antiviral therapy. Nat Med (2011) 6.21

MMDB: Entrez's 3D-structure database. Nucleic Acids Res (2003) 6.14

The NCBI BioSystems database. Nucleic Acids Res (2009) 4.80

MMDB: Entrez's 3D-structure database. Nucleic Acids Res (2002) 4.42

Seven new loci associated with age-related macular degeneration. Nat Genet (2013) 3.81

Protein subfamily assignment using the Conserved Domain Database. BMC Res Notes (2008) 3.62

An integrated analysis of the genome of the hyperthermophilic archaeon Pyrococcus abyssi. Mol Microbiol (2003) 2.75

BBS10 encodes a vertebrate-specific chaperonin-like protein and is a major BBS locus. Nat Genet (2006) 2.68

Identification and characterization of rod-derived cone viability factor. Nat Genet (2004) 2.63

Identification of a novel BBS gene (BBS12) highlights the major role of a vertebrate-specific branch of chaperonin-related proteins in Bardet-Biedl syndrome. Am J Hum Genet (2006) 2.44

Comparative analysis of ribosomal proteins in complete genomes: an example of reductive evolution at the domain scale. Nucleic Acids Res (2002) 2.24

MACSIMS: multiple alignment of complete sequences information management system. BMC Bioinformatics (2006) 2.23

Identification of genes associated with tumorigenesis and metastatic potential of hypopharyngeal cancer by microarray analysis. Oncogene (2004) 2.19

MMDB: 3D structures and macromolecular interactions. Nucleic Acids Res (2011) 2.06

Issues in bioinformatics benchmarking: the case study of multiple sequence alignment. Nucleic Acids Res (2010) 2.05

ADCK3, an ancestral kinase, is mutated in a form of recessive ataxia associated with coenzyme Q10 deficiency. Am J Hum Genet (2008) 2.04

Inferred Biomolecular Interaction Server--a web server to analyze and predict protein interacting partners and binding sites. Nucleic Acids Res (2009) 1.85

Annotation of functional sites with the Conserved Domain Database. Database (Oxford) (2012) 1.81

MMDB and VAST+: tracking structural similarities between macromolecular complexes. Nucleic Acids Res (2013) 1.79

PipeAlign: A new toolkit for protein family analysis. Nucleic Acids Res (2003) 1.78

A comprehensive benchmark study of multiple sequence alignment methods: current challenges and future perspectives. PLoS One (2011) 1.65

Single-stranded heteroduplex intermediates in lambda Red homologous recombination. BMC Mol Biol (2010) 1.51

Strictosidine synthase: mechanism of a Pictet-Spengler catalyzing enzyme. J Am Chem Soc (2008) 1.50

Identification of a subunit of a novel Kleisin-beta/SMC complex as a potential substrate of protein phosphatase 2A. Curr Biol (2003) 1.41

The evolutionary origin of peroxisomes: an ER-peroxisome connection. Mol Biol Evol (2006) 1.39

Ortho-proteogenomics: multiple proteomes investigation through orthology and a new MS-based protocol. Genome Res (2008) 1.36

Exome sequencing of Bardet-Biedl syndrome patient identifies a null mutation in the BBSome subunit BBIP1 (BBS18). J Med Genet (2013) 1.32

Cell-specific interaction of retinoic acid receptors with target genes in mouse embryonic fibroblasts and embryonic stem cells. Mol Cell Biol (2010) 1.30

Automated hierarchical classification of protein domain subfamilies based on functionally-divergent residue signatures. BMC Bioinformatics (2012) 1.29

Sequence and comparative genomic analysis of actin-related proteins. Mol Biol Cell (2005) 1.28

TBP2 is essential for germ cell development by regulating transcription and chromatin condensation in the oocyte. Genes Dev (2009) 1.27

OrthoInspector: comprehensive orthology analysis and visual exploration. BMC Bioinformatics (2011) 1.25

Whole-exome sequencing identifies mutations in GPR179 leading to autosomal-recessive complete congenital stationary night blindness. Am J Hum Genet (2012) 1.25

Rod-derived Cone Viability Factor-2 is a novel bifunctional-thioredoxin-like protein with therapeutic potential. BMC Mol Biol (2007) 1.24

Identification and functional characterization of a new member of the human Mcm protein family: hMcm8. Nucleic Acids Res (2003) 1.23

Whole-exome sequencing identifies LRIT3 mutations as a cause of autosomal-recessive complete congenital stationary night blindness. Am J Hum Genet (2012) 1.18

PtdIns5P regulation through evolution: roles in membrane trafficking? Trends Biochem Sci (2008) 1.17

A new look towards BAC-based array CGH through a comprehensive comparison with oligo-based array CGH. BMC Genomics (2007) 1.17

IntelliGO: a new vector-based semantic similarity measure including annotation origin. BMC Bioinformatics (2010) 1.16

IBIS (Inferred Biomolecular Interaction Server) reports, predicts and integrates multiple types of conserved interactions for proteins. Nucleic Acids Res (2011) 1.16

Pichia sorbitophila, an Interspecies Yeast Hybrid, Reveals Early Steps of Genome Resolution After Polyploidization. G3 (Bethesda) (2012) 1.12

PeroxisomeDB: a database for the peroxisomal proteome, functional genomics and disease. Nucleic Acids Res (2006) 1.08

Signature of the oligomeric behaviour of nuclear receptors at the sequence and structural level. EMBO Rep (2004) 1.08

Analysis of the retinal gene expression profile after hypoxic preconditioning identifies candidate genes for neuroprotection. BMC Genomics (2008) 1.08

Interrupted coding sequences in Mycobacterium smegmatis: authentic mutations or sequencing errors? Genome Biol (2007) 1.07

RNA polymerase II pausing downstream of core histone genes is different from genes producing polyadenylated transcripts. PLoS One (2012) 1.06

Spectrum of rhodopsin mutations in French autosomal dominant rod-cone dystrophy patients. Invest Ophthalmol Vis Sci (2010) 1.05

Polyglutamine expansion causes neurodegeneration by altering the neuronal differentiation program. Hum Mol Genet (2006) 1.05

ICDS database: interrupted CoDing sequences in prokaryotic genomes. Nucleic Acids Res (2006) 1.04

LEON: multiple aLignment Evaluation Of Neighbours. Nucleic Acids Res (2004) 1.04

Next generation sequencing for molecular diagnosis of neuromuscular diseases. Acta Neuropathol (2012) 1.03

Mitochondrial genome evolution in a single protoploid yeast species. G3 (Bethesda) (2012) 1.02

Defining and characterizing protein surface using alpha shapes. Proteins (2009) 1.01

A new protein linear motif benchmark for multiple sequence alignment software. BMC Bioinformatics (2008) 0.99

Pitfalls of homozygosity mapping: an extended consanguineous Bardet-Biedl syndrome family with two mutant genes (BBS2, BBS10), three mutations, but no triallelism. Eur J Hum Genet (2006) 0.98

Genome-wide evidence for an essential role of the human Staf/ZNF143 transcription factor in bidirectional transcription. Nucleic Acids Res (2010) 0.97

RReportGenerator: automatic reports from routine statistical analysis using R. Bioinformatics (2007) 0.97

The homeobox gene CHX10/VSX2 regulates RdCVF promoter activity in the inner retina. Hum Mol Genet (2009) 0.97

MAO: a Multiple Alignment Ontology for nucleic acid and protein sequences. Nucleic Acids Res (2005) 0.97

Insights into metazoan evolution from Alvinella pompejana cDNAs. BMC Genomics (2010) 0.97

The thioredoxin-like protein rod-derived cone viability factor (RdCVFL) interacts with TAU and inhibits its phosphorylation in the retina. Mol Cell Proteomics (2009) 0.96

Comparison of eukaryotic phylogenetic profiling approaches using species tree aware methods. BMC Bioinformatics (2009) 0.95

Genome-wide in silico identification of new conserved and functional retinoic acid receptor response elements (direct repeats separated by 5 bp). J Biol Chem (2011) 0.94

Density of points clustering, application to transcriptomic data analysis. Nucleic Acids Res (2002) 0.94

Phosphorylation by PKA potentiates retinoic acid receptor alpha activity by means of increasing interaction with and phosphorylation by cyclin H/cdk7. Proc Natl Acad Sci U S A (2006) 0.93

Homozygosity mapping and candidate prioritization identify mutations, missed by whole-exome sequencing, in SMOC2, causing major dental developmental defects. Am J Hum Genet (2011) 0.93

In vivo topoisomerase I inhibition attenuates the expression of hypoxia-inducible factor 1α target genes and decreases tumor angiogenesis. Mol Med (2012) 0.92

MSV3d: database of human MisSense Variants mapped to 3D protein structure. Database (Oxford) (2012) 0.92

Controversies in modern evolutionary biology: the imperative for error detection and quality control. BMC Genomics (2012) 0.90

Nonrandom variations in human cancer ESTs indicate that mRNA heterogeneity increases during carcinogenesis. Proc Natl Acad Sci U S A (2007) 0.89

Differential proteomic analysis of the mouse retina: the induction of crystallin proteins by retinal degeneration in the rd1 mouse. Mol Cell Proteomics (2003) 0.89

M13 endopeptidases: New conserved motifs correlated with structure, and simultaneous phylogenetic occurrence of PHEX and the bony fish. Proteins (2002) 0.88

Proteome adaptation to high temperatures in the ectothermic hydrothermal vent Pompeii worm. PLoS One (2012) 0.88

Evolutionary analysis of the ENTH/ANTH/VHS protein superfamily reveals a coevolution between membrane trafficking and metabolism. BMC Genomics (2012) 0.88

Life in an arsenic-containing gold mine: genome and physiology of the autotrophic arsenite-oxidizing bacterium rhizobium sp. NT-26. Genome Biol Evol (2013) 0.88

Transcriptomic analysis of human retinal detachment reveals both inflammatory response and photoreceptor death. PLoS One (2011) 0.88

Surface properties and intracellular speciation revealed an original adaptive mechanism to arsenic in the acid mine drainage bio-indicator Euglena mutabilis. Appl Microbiol Biotechnol (2011) 0.87

PromAn: an integrated knowledge-based web server dedicated to promoter analysis. Nucleic Acids Res (2006) 0.87

Complete mitochondrial genome sequence of the yeast Pichia farinosa and comparative analysis of closely related species. Curr Genet (2010) 0.86

RETINOBASE: a web database, data mining and analysis platform for gene expression data on retina. BMC Genomics (2008) 0.86

SM2PH-db: an interactive system for the integrated analysis of phenotypic consequences of missense mutations in proteins involved in human genetic diseases. Hum Mutat (2010) 0.85

M-ORBIS: mapping of molecular binding sites and surfaces. Nucleic Acids Res (2010) 0.85

KD4v: Comprehensible Knowledge Discovery System for Missense Variant. Nucleic Acids Res (2012) 0.85

Prebiotic-like chemistry on Titan. Chem Soc Rev (2012) 0.84

Amylases without known homologues discovered in an acid mine drainage: significance and impact. Sci Rep (2012) 0.84

Structure of the archaeal pab87 peptidase reveals a novel self-compartmentalizing protease family. PLoS One (2009) 0.83

Detecting the molecular scars of evolution in the Mycobacterium tuberculosis complex by analyzing interrupted coding sequences. BMC Evol Biol (2008) 0.83

vALId: validation of protein sequence quality based on multiple alignment data. J Bioinform Comput Biol (2005) 0.83

℮-conome: an automated tissue counting platform of cone photoreceptors for rodent models of retinitis pigmentosa. BMC Ophthalmol (2011) 0.82

Genome-wide analysis of intraspecific transposon diversity in yeast. BMC Genomics (2013) 0.82

Strategies for reliable exploitation of evolutionary concepts in high throughput biology. Evol Bioinform Online (2008) 0.81

An unusual retinal phenotype associated with a novel mutation in RHO. Arch Ophthalmol (2010) 0.81