Ascertainment bias in studies of human genome-wide polymorphism.

PubWeight™: 6.63‹?› | Rank: Top 1%

🔗 View Article (PMC 1310637)

Published in Genome Res on November 01, 2005

Authors

Andrew G Clark1, Melissa J Hubisz, Carlos D Bustamante, Scott H Williamson, Rasmus Nielsen

Author Affiliations

1: Molecular Biology and Genetics and Computational Biology, Cornell University, Ithaca, New York 14853, USA. ac347@cornell.edu

Articles citing this

(truncated to the top 100)

Genes mirror geography within Europe. Nature (2008) 14.23

Inferring the joint demographic history of multiple populations from multidimensional SNP frequency data. PLoS Genet (2009) 9.16

Assessing the evolutionary impact of amino acid mutations in the human genome. PLoS Genet (2008) 8.92

Measurement of the human allele frequency spectrum demonstrates greater genetic drift in East Asians than in Europeans. Nat Genet (2007) 6.63

Constructing genomic maps of positive selection in humans: where do we go from here? Genome Res (2009) 5.88

Localizing recent adaptive evolution in the human genome. PLoS Genet (2007) 5.11

The Population Reference Sample, POPRES: a resource for population, disease, and pharmacological genetics research. Am J Hum Genet (2008) 4.79

Genome-wide association studies in diverse populations. Nat Rev Genet (2010) 4.68

Recent and ongoing selection in the human genome. Nat Rev Genet (2007) 4.62

Association mapping: critical considerations shift from genotyping to experimental design. Plant Cell (2009) 4.59

Progress and promise of genome-wide association studies for human complex trait genetics. Genetics (2010) 3.81

A new approach for using genome scans to detect recent positive selection in the human genome. PLoS Biol (2007) 3.71

Widespread genomic signatures of natural selection in hominid evolution. PLoS Genet (2009) 3.68

Testing for ancient admixture between closely related populations. Mol Biol Evol (2011) 3.56

A customized and versatile high-density genotyping array for the mouse. Nat Methods (2009) 3.53

TASSEL-GBS: a high capacity genotyping by sequencing analysis pipeline. PLoS One (2014) 3.45

Forces shaping the fastest evolving regions in the human genome. PLoS Genet (2006) 3.32

Genome-wide analysis of single nucleotide polymorphisms uncovers population structure in Northern Europe. PLoS One (2008) 3.09

The population genetics of structural variation. Nat Genet (2007) 3.07

Runs of homozygosity reveal highly penetrant recessive loci in schizophrenia. Proc Natl Acad Sci U S A (2007) 2.81

Crop genomics: advances and applications. Nat Rev Genet (2011) 2.72

Genomic signatures of positive selection in humans and the limits of outlier approaches. Genome Res (2006) 2.56

Single nucleotide polymorphism genotyping in polyploid wheat with the Illumina GoldenGate assay. Theor Appl Genet (2009) 2.56

Ascertainment biases in SNP chips affect measures of population divergence. Mol Biol Evol (2010) 2.56

Robust demographic inference from genomic and SNP data. PLoS Genet (2013) 2.48

Comprehensive genotyping of the USA national maize inbred seed bank. Genome Biol (2013) 2.46

Signatures of environmental genetic adaptation pinpoint pathogens as the main selective pressure through human evolution. PLoS Genet (2011) 2.44

Evolutionary dynamics of human Toll-like receptors and their different contributions to host defense. PLoS Genet (2009) 2.35

Analysis of genomic diversity in Mexican Mestizo populations to develop genomic medicine in Mexico. Proc Natl Acad Sci U S A (2009) 2.32

Methodological challenges of genome-wide association analysis in Africa. Nat Rev Genet (2010) 2.27

2b-RAD: a simple and flexible method for genome-wide genotyping. Nat Methods (2012) 2.24

Fregene: simulation of realistic sequence-level data in populations and ascertained samples. BMC Bioinformatics (2008) 2.23

Genomic runs of homozygosity record population history and consanguinity. PLoS One (2010) 2.22

Accelerated genetic drift on chromosome X during the human dispersal out of Africa. Nat Genet (2008) 2.22

Whole-genome sequencing and comprehensive variant analysis of a Japanese individual using massively parallel sequencing. Nat Genet (2010) 2.08

Population differentiation as a test for selective sweeps. Genome Res (2010) 2.08

Controlling the false-positive rate in multilocus genome scans for selection. Genetics (2006) 2.00

A novel DNA sequence database for analyzing human demographic history. Genome Res (2008) 1.85

Linkage disequilibrium and demographic history of wild and domestic canids. Genetics (2009) 1.79

Genome-wide assessment of worldwide chicken SNP genetic diversity indicates significant absence of rare alleles in commercial breeds. Proc Natl Acad Sci U S A (2008) 1.75

Learning about human population history from ancient and modern genomes. Nat Rev Genet (2011) 1.74

Rapidly developing functional genomics in ecological model systems via 454 transcriptome sequencing. Genetica (2008) 1.74

Estimating and interpreting FST: the impact of rare variants. Genome Res (2013) 1.73

Whole genome association mapping by incompatibilities and local perfect phylogenies. BMC Bioinformatics (2006) 1.68

Human population dispersal "Out of Africa" estimated from linkage disequilibrium and allele frequencies of SNPs. Genome Res (2011) 1.64

Genome-wide insights into the patterns and determinants of fine-scale population structure in humans. Am J Hum Genet (2009) 1.64

Genome-wide footprints of pig domestication and selection revealed through massive parallel sequencing of pooled DNA. PLoS One (2011) 1.60

Evaluating cost efficiency of SNP chips in genome-wide association studies. Genet Epidemiol (2008) 1.60

Diverse splicing patterns of exonized Alu elements in human tissues. PLoS Genet (2008) 1.52

Microsatellites are molecular clocks that support accurate inferences about history. Mol Biol Evol (2009) 1.48

Correcting estimators of theta and Tajima's D for ascertainment biases caused by the single-nucleotide polymorphism discovery process. Genetics (2008) 1.44

Inference of relationships in population data using identity-by-descent and identity-by-state. PLoS Genet (2011) 1.43

APOL1 variants and kidney disease in people of recent African ancestry. Nat Rev Nephrol (2013) 1.43

Relaxed Observance of Traditional Marriage Rules Allows Social Connectivity without Loss of Genetic Diversity. Mol Biol Evol (2015) 1.42

Methods for human demographic inference using haplotype patterns from genomewide single-nucleotide polymorphism data. Genetics (2009) 1.42

Frequency spectrum neutrality tests: one for all and all for one. Genetics (2009) 1.39

Characterizing natural variation using next-generation sequencing technologies. Trends Genet (2009) 1.39

Discovery of novel variants in genotyping arrays improves genotype retention and reduces ascertainment bias. BMC Genomics (2012) 1.39

Signatures of demographic history and natural selection in the human major histocompatibility complex Loci. Genetics (2006) 1.37

A nuclear phylogenetic analysis: SNPs, indels and SSRs deliver new insights into the relationships in the 'true citrus fruit trees' group (Citrinae, Rutaceae) and the origin of cultivated species. Ann Bot (2012) 1.36

Population genomic inferences from sparse high-throughput sequencing of two populations of Drosophila melanogaster. Genome Biol Evol (2009) 1.36

Inferring genetic ancestry: opportunities, challenges, and implications. Am J Hum Genet (2010) 1.36

The structure of common genetic variation in United States populations. Am J Hum Genet (2007) 1.35

The role and challenges of exome sequencing in studies of human diseases. Front Genet (2013) 1.33

A genome-wide signature of positive selection in ancient and recent invasive expansions of the honey bee Apis mellifera. Proc Natl Acad Sci U S A (2008) 1.30

SNP mining in C. clementina BAC end sequences; transferability in the Citrus genus (Rutaceae), phylogenetic inferences and perspectives for genetic mapping. BMC Genomics (2012) 1.29

Effects of ascertainment bias and marker number on estimations of barley diversity from high-throughput SNP genotype data. Theor Appl Genet (2010) 1.28

Genomic signatures of local directional selection in a high gene flow marine organism; the Atlantic cod (Gadus morhua). BMC Evol Biol (2009) 1.28

Allele frequency matching between SNPs reveals an excess of linkage disequilibrium in genic regions of the human genome. PLoS Genet (2006) 1.27

High-throughput sequencing reveals inbreeding depression in a natural population. Proc Natl Acad Sci U S A (2014) 1.27

Efficient moment-based inference of admixture parameters and sources of gene flow. Mol Biol Evol (2013) 1.26

Intergenic DNA sequences from the human X chromosome reveal high rates of global gene flow. BMC Genet (2008) 1.23

Multiethnic genetic association studies improve power for locus discovery. PLoS One (2010) 1.23

A two-stage pruning algorithm for likelihood computation for a population tree. Genetics (2008) 1.21

SNP ascertainment bias in population genetic analyses: why it is important, and how to correct it. Bioessays (2013) 1.18

Recent human adaptation: genomic approaches, interpretation and insights. Nat Rev Genet (2013) 1.15

The promise and limitations of population exomics for human evolution studies. Genome Biol (2011) 1.13

Genetic diversity in India and the inference of Eurasian population expansion. Genome Biol (2010) 1.13

Prion gene haplotypes of U.S. cattle. BMC Genet (2006) 1.10

Even small SNP clusters are non-randomly distributed: is this evidence of mutational non-independence? Proc Biol Sci (2010) 1.10

Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping. BMC Genomics (2011) 1.09

Patterns of population differentiation of candidate genes for cardiovascular disease. BMC Genet (2007) 1.09

Large-scale in silico mapping of complex quantitative traits in inbred mice. PLoS One (2007) 1.08

Candidate genes and genetic architecture of symbiotic and agronomic traits revealed by whole-genome, sequence-based association genetics in Medicago truncatula. PLoS One (2013) 1.08

Genome-wide prediction of functional gene-gene interactions inferred from patterns of genetic differentiation in mice and men. PLoS One (2008) 1.07

Direct inference of SNP heterozygosity rates and resolution of LOH detection. PLoS Comput Biol (2007) 1.06

Selection for translation efficiency on synonymous polymorphisms in recent human evolution. Genome Biol Evol (2011) 1.05

Megabase-scale inversion polymorphism in the wild ancestor of maize. Genetics (2012) 1.05

The evolution of vertebrate opioid receptors. Front Biosci (Landmark Ed) (2009) 1.05

Population substructure in Finland and Sweden revealed by the use of spatial coordinates and a small number of unlinked autosomal SNPs. BMC Genet (2008) 1.02

Population Genomics of Human Adaptation. Annu Rev Ecol Evol Syst (2013) 1.01

The joint allele-frequency spectrum in closely related species. Genetics (2007) 1.01

Vitis phylogenomics: hybridization intensities from a SNP array outperform genotype calls. PLoS One (2013) 1.01

Targeted capture in evolutionary and ecological genomics. Mol Ecol (2015) 1.01

Adaptive evolution in humans revealed by the negative correlation between the polymorphism and fixation phases of evolution. Proc Natl Acad Sci U S A (2007) 1.00

Multiple advantageous amino acid variants in the NAT2 gene in human populations. PLoS One (2008) 1.00

Genome-wide analysis of natural selection on human cis-elements. PLoS One (2008) 1.00

Patterns of evolutionary constraints on genes in humans. BMC Evol Biol (2008) 0.99

Genome-wide SNP and microsatellite variation illuminate population-level epidemiology in the Leishmania donovani species complex. Infect Genet Evol (2011) 0.98

Inter-chromosomal variation in the pattern of human population genetic structure. Hum Genomics (2011) 0.98

Articles cited by this

Prospects for whole-genome linkage disequilibrium mapping of common disease genes. Nat Genet (1999) 25.93

A test of neutral molecular evolution based on nucleotide data. Genetics (1987) 24.44

Use of unlinked genetic markers to detect population stratification in association studies. Am J Hum Genet (1999) 23.67

Are rare variants responsible for susceptibility to complex diseases? Am J Hum Genet (2001) 22.46

Whole-genome patterns of common DNA variation in three human populations. Science (2005) 21.22

Hitchhiking under positive Darwinian selection. Genetics (2000) 14.72

Natural selection on protein-coding genes in the human genome. Nature (2005) 10.84

The pattern of polymorphism in Arabidopsis thaliana. PLoS Biol (2005) 10.13

Haplotype variation and linkage disequilibrium in 313 human genes. Science (2001) 8.52

Inferring coalescence times from DNA sequence data. Genetics (1997) 8.08

Simultaneous inference of selection and population growth from patterns of variation in the human genome. Proc Natl Acad Sci U S A (2005) 5.94

Pattern of sequence variation across 213 environmental response genes. Genome Res (2004) 3.93

Haplotype diversity across 100 candidate genes for inflammation, lipid metabolism, and blood pressure regulation in two populations. Am J Hum Genet (2004) 3.91

Usefulness of single nucleotide polymorphism data for estimating population parameters. Genetics (2000) 3.78

Reconstituting the frequency spectrum of ascertained single-nucleotide polymorphism data. Genetics (2004) 3.65

The discovery of single-nucleotide polymorphisms--and inferences about human demographic history. Am J Hum Genet (2001) 3.50

Evidence for population growth in humans is confounded by fine-scale population structure. Trends Genet (2002) 2.79

Correcting for ascertainment biases when analyzing SNP data: applications to the estimation of linkage disequilibrium. Theor Popul Biol (2003) 2.73

Population genetic analysis of ascertained SNP data. Hum Genomics (2004) 2.41

The effect of single nucleotide polymorphism identification strategies on estimates of linkage disequilibrium. Mol Biol Evol (2003) 1.93

Articles by these authors

The diploid genome sequence of an Asian individual. Nature (2008) 46.29

Evolution of genes and genomes on the Drosophila phylogeny. Nature (2007) 18.01

Evolution and functional impact of rare coding variation from deep sequencing of human exomes. Science (2012) 17.12

Evolutionary and biomedical insights from the rhesus macaque genome. Science (2007) 16.21

The sequence and de novo assembly of the giant panda genome. Nature (2009) 15.76

Genes mirror geography within Europe. Nature (2008) 14.23

Detection of nonneutral substitution rates on mammalian phylogenies. Genome Res (2009) 12.58

A metagenome-wide association study of gut microbiota in type 2 diabetes. Nature (2012) 11.68

Sequencing of 50 human exomes reveals adaptation to high altitude. Science (2010) 11.27

Bayes empirical bayes inference of amino acid sites under positive selection. Mol Biol Evol (2005) 11.26

Natural selection on protein-coding genes in the human genome. Nature (2005) 10.84

Inferring nonneutral evolution from human-chimp-mouse orthologous gene trios. Science (2003) 9.20

The use of coded PCR primers enables high-throughput sequencing of multiple homolog amplification products by 454 parallel sequencing. PLoS One (2007) 9.17

Inferring the joint demographic history of multiple populations from multidimensional SNP frequency data. PLoS Genet (2009) 9.16

Assessing the evolutionary impact of amino acid mutations in the human genome. PLoS Genet (2008) 8.92

Evaluation of an improved branch-site likelihood method for detecting positive selection at the molecular level. Mol Biol Evol (2005) 8.74

A high-resolution map of human evolutionary constraint using 29 mammals. Nature (2011) 8.67

Comparative genome sequencing of Drosophila pseudoobscura: chromosomal, gene, and cis-element evolution. Genome Res (2005) 8.38

A scan for positively selected genes in the genomes of humans and chimpanzees. PLoS Biol (2005) 8.32

Multilocus methods for estimating population sizes, migration rates and divergence time, with applications to the divergence of Drosophila pseudoobscura and D. persimilis. Genetics (2004) 7.79

Genome-wide survey of SNP variation uncovers the genetic structure of cattle breeds. Science (2009) 7.64

Ancient human genome sequence of an extinct Palaeo-Eskimo. Nature (2010) 7.51

Demographic history and rare allele sharing among human populations. Proc Natl Acad Sci U S A (2011) 7.36

Codon-substitution models for detecting molecular adaptation at individual sites along specific lineages. Mol Biol Evol (2002) 7.21

The evolution of gene expression levels in mammalian organs. Nature (2011) 7.16

Proportionally more deleterious genetic variation in European than in African populations. Nature (2008) 6.61

Genomic scans for selective sweeps using SNP data. Genome Res (2005) 6.43

Simultaneous inference of selection and population growth from patterns of variation in the human genome. Proc Natl Acad Sci U S A (2005) 5.94

Integration within the Felsenstein equation for improved Markov chain Monte Carlo methods in population genetics. Proc Natl Acad Sci U S A (2007) 5.64

Resequencing of 200 human exomes identifies an excess of low-frequency non-synonymous coding variants. Nat Genet (2010) 5.44

Genome-wide association mapping reveals a rich genetic architecture of complex traits in Oryza sativa. Nat Commun (2011) 5.41

Genome-wide patterns of population structure and admixture in West Africans and African Americans. Proc Natl Acad Sci U S A (2009) 5.39

Localizing recent adaptive evolution in the human genome. PLoS Genet (2007) 5.11

Resequencing 50 accessions of cultivated and wild rice yields markers for identifying agronomically important genes. Nat Biotechnol (2011) 4.84

An Aboriginal Australian genome reveals separate human dispersals into Asia. Science (2011) 4.84

The Population Reference Sample, POPRES: a resource for population, disease, and pharmacological genetics research. Am J Hum Genet (2008) 4.79

Patterns of positive selection in six Mammalian genomes. PLoS Genet (2008) 4.69

A single IGF1 allele is a major determinant of small size in dogs. Science (2007) 4.68

Genomewide SNP variation reveals relationships among landraces and modern varieties of rice. Proc Natl Acad Sci U S A (2009) 4.65

Genome-wide SNP and haplotype analyses reveal a rich history underlying dog domestication. Nature (2010) 4.64

ClinGen--the Clinical Genome Resource. N Engl J Med (2015) 4.45

The functional spectrum of low-frequency coding variation. Genome Biol (2011) 4.42

A Markov chain Monte Carlo approach for joint inference of population structure and inbreeding rates from multilocus genotype data. Genetics (2007) 4.31

Accuracy and power of statistical methods for detecting adaptive evolution in protein coding sequences and for identifying positively selected sites. Genetics (2004) 4.22

Effect of recombination on the accuracy of the likelihood method for detecting positive selection at amino acid sites. Genetics (2003) 3.89

Molecular and evolutionary history of melanism in North American gray wolves. Science (2009) 3.87

Genome-wide patterns of nucleotide polymorphism in domesticated rice. PLoS Genet (2007) 3.83

Comparative and demographic analysis of orang-utan genomes. Nature (2011) 3.83

The cost of inbreeding in Arabidopsis. Nature (2002) 3.80

Ancient biomolecules from deep ice cores reveal a forested southern Greenland. Science (2007) 3.79

Reconstituting the frequency spectrum of ascertained single-nucleotide polymorphism data. Genetics (2004) 3.65

A mutation in the myostatin gene increases muscle mass and enhances racing performance in heterozygote dogs. PLoS Genet (2007) 3.61

Hunter-gatherer genomic diversity suggests a southern African origin for modern humans. Proc Natl Acad Sci U S A (2011) 3.53

Linkage disequilibrium as a signature of selective sweeps. Genetics (2004) 3.52

A simple genetic architecture underlies morphological variation in dogs. PLoS Biol (2010) 3.46

Recalibrating Equus evolution using the genome sequence of an early Middle Pleistocene horse. Nature (2013) 3.45

Pervasive adaptive evolution in mammalian fertilization proteins. Mol Biol Evol (2003) 3.42

An expressed fgf4 retrogene is associated with breed-defining chondrodysplasia in domestic dogs. Science (2009) 3.29

Colloquium paper: genome-wide patterns of population structure and admixture among Hispanic/Latino populations. Proc Natl Acad Sci U S A (2010) 3.20

Phased whole-genome genetic risk in a family quartet using a major allele reference sequence. PLoS Genet (2011) 3.20

New insights into the Tyrolean Iceman's origin and phenotype as inferred by whole-genome sequencing. Nat Commun (2012) 3.18

Stochastic mapping of morphological characters. Syst Biol (2003) 3.08

Bayesian inference of ancient human demography from individual genome sequences. Nat Genet (2011) 3.03

Darwinian and demographic forces affecting human protein coding genes. Genome Res (2009) 3.01

Great ape genetic diversity and population history. Nature (2013) 2.95

Genetic structure and domestication history of the grape. Proc Natl Acad Sci U S A (2011) 2.89

Global distribution of genomic diversity underscores rich complex history of continental human populations. Genome Res (2009) 2.76

Coat variation in the domestic dog is governed by variants in three genes. Science (2009) 2.70

Complete resequencing of 40 genomes reveals domestication events and genes in silkworm (Bombyx). Science (2009) 2.68

Distinguishing between selective sweeps and demography using DNA polymorphism data. Genetics (2005) 2.65

Demographic histories and patterns of linkage disequilibrium in Chinese and Indian rhesus macaques. Science (2007) 2.62

Species-specific responses of Late Quaternary megafauna to climate and humans. Nature (2011) 2.60

Genomic diversity and introgression in O. sativa reveal the impact of domestication and breeding on the rice genome. PLoS One (2010) 2.59

Context dependence, ancestral misidentification, and spurious signatures of natural selection. Mol Biol Evol (2007) 2.56

Adaptive genic evolution in the Drosophila genomes. Proc Natl Acad Sci U S A (2007) 2.56

Ascertainment biases in SNP chips affect measures of population divergence. Mol Biol Evol (2010) 2.56

Inference of historical changes in migration rate from the lengths of migrant tracts. Genetics (2008) 2.53