Identification of significant sequence patterns in proteins.

PubWeight™: 1.99‹?› | Rank: Top 2%

🔗 View Article (PMID 2179677)

Published in Methods Enzymol on January 01, 1990

Authors

S Karlin, B E Blaisdell, V Brendel

Articles by these authors

Prediction of complete gene structures in human genomic DNA. J Mol Biol (1997) 58.76

Finding the genes in genomic DNA. Curr Opin Struct Biol (1998) 7.31

Linkage and selection: two locus symmetric viability model. Theor Popul Biol (1970) 5.90

Over- and under-representation of short oligonucleotides in DNA sequences. Proc Natl Acad Sci U S A (1992) 5.55

General two-locus selection models: some objectives, results and interpretations. Theor Popul Biol (1975) 5.16

A computer algorithm for testing potential prokaryotic terminators. Nucleic Acids Res (1984) 4.73

Polymorphisms for genetic and ecological systems with weak coupling. Theor Popul Biol (1972) 4.47

Towards a theory of the evolution of modifier genes. Theor Popul Biol (1974) 3.79

Application of method of small parameters to multi-niche population genetic models. Theor Popul Biol (1972) 3.72

Rates and probabilities of fixation for two locus random mating finite populations without selection. Genetics (1968) 3.64

Methods and algorithms for statistical analysis of protein sequences. Proc Natl Acad Sci U S A (1992) 3.63

Numerical studies on two-loci selection models with general viabilities. Theor Popul Biol (1975) 3.20

Optimal spliced alignment of homologous cDNA to a genomic DNA template. Bioinformatics (2000) 3.14

New approaches for computer analysis of nucleic acid sequences. Proc Natl Acad Sci U S A (1983) 2.85

A measure of the similarity of sets of sequences not requiring sequence alignment. Proc Natl Acad Sci U S A (1986) 2.68

Association of charge clusters with functional domains of cellular transcription factors. Proc Natl Acad Sci U S A (1989) 2.58

On mutation selection balance for two-locus haploid and diploid populations. Theor Popul Biol (1971) 2.57

Random temporal variation in selection intensities: case of large population size. Theor Popul Biol (1974) 2.46

Genome signature comparisons among prokaryote, plasmid, and mitochondrial DNA. Proc Natl Acad Sci U S A (1999) 2.38

Pervasive CpG suppression in animal mitochondrial genomes. Proc Natl Acad Sci U S A (1994) 2.35

Genome-scale compositional comparisons in eukaryotes. Genome Res (2001) 2.32

Statistical methods and insights for protein and DNA sequences. Annu Rev Biophys Biophys Chem (1991) 2.32

Why are human G-protein-coupled receptors predominantly intronless? Trends Genet (1999) 2.31

Further analysis of negative assortative mating. Genetics (1968) 2.27

Chance and statistical significance in protein and DNA sequence analysis. Science (1992) 2.25

Human cytomegalovirus origin of DNA replication (oriLyt) resides within a highly complex repetitive region. Proc Natl Acad Sci U S A (1992) 2.09

Heterogeneity of genomes: measures and values. Proc Natl Acad Sci U S A (1994) 1.94

Statistical methods for assessing linkage disequilibrium at the HLA-A, B, C loci. Ann Hum Genet (1981) 1.93

Analysis of biochemical genetic data on Jewish populations: II. Results and interpretations of heterogeneity indices and distance measures with respect to standards. Am J Hum Genet (1979) 1.91

Charge configurations in oncogene products and transforming proteins. Oncogene (1990) 1.85

Strand compositional asymmetry in bacterial and large viral genomes. Proc Natl Acad Sci U S A (1998) 1.84

Detecting alien genes in bacterial genomes. Ann N Y Acad Sci (1999) 1.80

Theoretical models of genetic map functions. Theor Popul Biol (1984) 1.76

A method to identify distinctive charge configurations in protein sequences, with application to human herpesvirus polypeptides. J Mol Biol (1989) 1.75

Evolutionary comparisons of RecA-like proteins across all major kingdoms of living organisms. J Mol Evol (1997) 1.67

Similarities and dissimilarities of phage genomes. Proc Natl Acad Sci U S A (1996) 1.67

Markov chain analysis finds a significant influence of neighboring bases on the occurrence of a base in eucaryotic nuclear DNA sequences both protein-coding and noncoding. J Mol Evol (1985) 1.67

Bacterial classifications derived from recA protein sequence comparisons. J Bacteriol (1995) 1.66

Sibling and parent--offspring correlation estimation with variable family size. Proc Natl Acad Sci U S A (1981) 1.63

Very long charge runs in systemic lupus erythematosus-associated autoantigens. Proc Natl Acad Sci U S A (1991) 1.58

Significant similarity and dissimilarity in homologous proteins. Mol Biol Evol (1992) 1.57

Multi-query sequence BLAST output examination with MuSeqBox. Bioinformatics (2001) 1.51

Clusters of charged residues in protein three-dimensional structures. Proc Natl Acad Sci U S A (1996) 1.50

Identification of biased amino acid substitution patterns in human immunodeficiency virus type 1 isolates from patients treated with protease inhibitors. J Virol (1999) 1.50

Patchiness and correlations in DNA sequences. Science (1993) 1.47

Measures of residue density in protein structures. Proc Natl Acad Sci U S A (1999) 1.46

Analysis of genetic data on Jewish populations. I. Historical background, demographic features, and genetic markers. Am J Hum Genet (1979) 1.43

Analysis of models with homozygote x heterozygote matings. Genetics (1968) 1.42

Gene discovery using the maize genome database ZmDB. Nucleic Acids Res (2000) 1.41

Conservation among HSP60 sequences in relation to structure, function, and evolution. Protein Sci (2000) 1.41

Structured exploratory data analysis (SEDA) for determining mode of inheritance of quantitative traits. I. Simulation studies on the effect of background distributions. Am J Hum Genet (1981) 1.40

Linkage and selection: new equilibrium properties of the two-locus symmetric viability model. Proc Natl Acad Sci U S A (1969) 1.40

Highly expressed and alien genes of the Synechocystis genome. Nucleic Acids Res (2001) 1.33

An efficient algorithm for identifying matches with errors in multiple long molecular sequences. J Mol Biol (1991) 1.32

Central equilibria in multilocus systems. I. Generalized nonepistatic selection regimes. Genetics (1979) 1.32

A symmetric-iterated multiple alignment of protein sequences. J Mol Biol (1998) 1.31

Too many leucine zippers? Nature (1989) 1.27

Quantile distributions of amino acid usage in protein classes. Protein Eng (1992) 1.27

Addendum to a paper of W. Ewens. Theor Popul Biol (1972) 1.24

The evolutionary development of modifier genes. Proc Natl Acad Sci U S A (1972) 1.24

Assessments of DNA inhomogeneities in yeast chromosome III. Nucleic Acids Res (1993) 1.22

Comparisons of positive assortative mating and sexual selection models. Theor Popul Biol (1978) 1.22

Assortative mating based on phenotype. I. Two alleles with dominance. Genetics (1969) 1.22

Assortative mating based on phenotype. II. Two autosomal alleles without dominance. Genetics (1969) 1.15

Genetic analysis of the Stanford LRC family study data. I. Structured exploratory data analysis of height and weight measurements. Am J Epidemiol (1981) 1.14

Molecular characterization of a mutable pigmentation phenotype and isolation of the first active transposable element from Sorghum bicolor. Proc Natl Acad Sci U S A (1999) 1.13

Evolutionary aspects and sensitivity studies of some major gene models. J Theor Biol (1978) 1.13

Geometry of interplanar residue contacts in protein structures. Proc Natl Acad Sci U S A (1994) 1.12

Contrasts in codon usage of latent versus productive genes of Epstein-Barr virus: data and hypotheses. J Virol (1990) 1.10

Index measures for assessing the mode of inheritance of continuously distributed traits: I, theory and justifications. Theor Popul Biol (1979) 1.09

Significant potential secondary structures in the Epstein-Barr virus genome. Proc Natl Acad Sci U S A (1986) 1.09

Models of multifactorial inheritance: II. The covariance structure for a scalar phenotype under selective assortative mating and sex-dependent symmetric parental-transmission. Theor Popul Biol (1979) 1.08

The evolution of dominance: a direct approach through the theory of linkage and selection. Theor Popul Biol (1971) 1.06

Gene structure prediction by spliced alignment of genomic DNA with protein sequences: increased accuracy by differential splice site scoring. J Mol Biol (2000) 1.05

Association arrays for comparing familial total cholesterol, high density lipoprotein cholesterol, and triglyceride similarity in the Israeli population by country of origin. Am J Epidemiol (1982) 1.04

U-richness is a defining feature of plant introns and may function as an intron recognition signal in maize. Plant Mol Biol (1998) 1.04

Association arrays for the study of familial height, weight, lipid, and lipoprotein similarity in three West Coast populations. Am J Epidemiol (1982) 1.03

structured exploratory data analysis (SEDA) for determining mode of inheritance of quantitative traits. II. simulation studies on the effect of ascertaining families through high-valued probands. Am J Hum Genet (1981) 1.03

Path analysis in genetic epidemiology: a critique. Am J Hum Genet (1983) 1.02

Genetic analysis of the Stanford LRC family study data. II. Structured exploratory data analysis of lipids and lipoproteins. Am J Epidemiol (1981) 1.02

Significant dispersed recurrent DNA sequences in the Escherichia coli genome. Several new groups. J Mol Biol (1993) 1.01

Test of the combinatorial model of intron recognition in a native maize gene. Plant Mol Biol (1999) 1.00

The use of multiple alphabets in kappa-gene immunoglobulin DNA sequence comparisons. EMBO J (1985) 1.00

Charge configurations in viral proteins. Proc Natl Acad Sci U S A (1988) 0.99

Distinctive charge configurations in proteins of the Epstein-Barr virus and possible functions. Proc Natl Acad Sci U S A (1988) 0.99

Multiple-alphabet amino acid sequence comparisons of the immunoglobulin kappa-chain constant domain. Proc Natl Acad Sci U S A (1985) 0.98

A phenotypic symmetric selection model for three loci, two alleles: the case of tight linkage. Theor Popul Biol (1976) 0.98

Random temporal variation in selection intensities acting on infinite diploid populations: diffusion method analysis. Theor Popul Biol (1975) 0.98

Theoretical studies on sex ratio evolution. Monogr Popul Biol (1986) 0.98

Representation of Nonepistatic selection models and analysis of multilocus Hardy-Weinberg Equilibrium configurations. J Math Biol (1979) 0.97

How are close residues of protein structures distributed in primary sequence? Proc Natl Acad Sci U S A (1995) 0.96

Gene frequency patterns in the Levene subdivided population model. Theor Popul Biol (1977) 0.96

A prevalent persistent global nonrandomness that distinguishes coding and non-coding eucaryotic nuclear DNA sequences. J Mol Evol (1983) 0.95

Comparative statistics for DNA and protein sequences: single sequence analysis. Proc Natl Acad Sci U S A (1985) 0.94

Incidence of squamous cell carcinoma in hairless mice irradiated with ultraviolet light in relation to intake of ascorbic acid (vitamin C) and of D, L-alpha-tocopheryl acetate (vitamin E). Int J Vitam Nutr Res Suppl (1982) 0.93