Bayesian Top-Down Protein Sequence Alignment with Inferred Position-Specific Gap Penalties.

PubWeight™: 0.77‹?›

🔗 View Article (PMID 27192614)

Published in PLoS Comput Biol on May 18, 2016

Authors

Andrew F Neuwald1, Stephen F Altschul2

Author Affiliations

1: Institute for Genome Sciences and Department of Biochemistry & Molecular Biology, University of Maryland School of Medicine, Baltimore, Maryland, United States of America.
2: National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America.

Articles cited by this

MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res (2004) 168.89

Optimization by simulated annealing. Science (1983) 71.02

Profile hidden Markov models. Bioinformatics (1998) 56.04

MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics (2004) 50.89

MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res (2002) 47.62

Progressive sequence alignment as a prerequisite to correct phylogenetic trees. J Mol Evol (1987) 41.41

MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol (2013) 34.34

Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol (2011) 28.61

Position-based sequence weights. J Mol Biol (1994) 24.41

Dirichlet mixtures: a method for improved detection of weak but significant protein sequence homology. Comput Appl Biosci (1996) 19.74

DIALIGN 2: improvement of the segment-to-segment approach to multiple sequence alignment. Bioinformatics (1999) 12.22

Using Dirichlet mixture priors to derive hidden Markov models for protein families. Proc Int Conf Intell Syst Mol Biol (1993) 10.73

CDD: NCBI's conserved domain database. Nucleic Acids Res (2014) 8.25

Kalign--an accurate and fast multiple sequence alignment algorithm. BMC Bioinformatics (2005) 7.01

BAliBASE 3.0: latest developments of the multiple sequence alignment benchmark. Proteins (2005) 6.57

DIALIGN: finding local similarities by multiple sequence alignment. Bioinformatics (1998) 5.11

PartTree: an algorithm to build an approximate tree from a large number of unaligned sequences. Bioinformatics (2006) 3.92

Extracting protein alignment models from the sequence database. Nucleic Acids Res (1997) 3.17

Kalign2: high-performance multiple alignment of protein and nucleotide sequences allowing external features. Nucleic Acids Res (2008) 2.97

MAFFT: iterative refinement and additional methods. Methods Mol Biol (2014) 2.31

Sequence embedding for fast construction of guide trees for multiple sequence alignment. Algorithms Mol Biol (2010) 2.15

Clustal Omega, accurate alignment of very large numbers of sequences. Methods Mol Biol (2014) 2.11

The construction and use of log-odds substitution scores for multiple sequence alignment. PLoS Comput Biol (2010) 1.54

Automated hierarchical classification of protein domain subfamilies based on functionally-divergent residue signatures. BMC Bioinformatics (2012) 1.29

Simple chained guide trees give high-quality protein multiple sequence alignments. Proc Natl Acad Sci U S A (2014) 1.20

Gapped alignment of protein sequence motifs through Monte Carlo optimization of a hidden Markov model. BMC Bioinformatics (2004) 1.12

PASTA: Ultra-Large Multiple Sequence Alignment for Nucleotide and Amino-Acid Sequences. J Comput Biol (2014) 0.99

A Bayesian sampler for optimization of protein domain hierarchies. J Comput Biol (2014) 0.84

Dirichlet mixtures, the Dirichlet process, and the structure of protein space. J Comput Biol (2013) 0.79

Protein domain hierarchy Gibbs sampling strategies. Stat Appl Genet Mol Biol (2014) 0.78

Multiple sequence alignment with DIALIGN. Methods Mol Biol (2014) 0.77