Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies.

PubWeight™: 53.11‹?› | Rank: Top 0.01% | All-Time Top 1000

🔗 View Article (PMC 1462648)

Published in Genetics on August 01, 2003

Authors

Daniel Falush1, Matthew Stephens, Jonathan K Pritchard

Author Affiliations

1: Department of Molecular Biology, Max-Planck Institut für Infektionsbiologie, Schumann Strasse 21/22, 10117 Berlin, Germany. falush@mpiib-berlin.mpg.de

Associated clinical trials:

Family Blood Pressure Program - GenNet Network | NCT00005268

Articles citing this

(truncated to the top 100)

Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature (2007) 144.95

Population structure and eigenanalysis. PLoS Genet (2006) 37.21

A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase. Am J Hum Genet (2006) 28.32

A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer. Nat Genet (2007) 22.96

GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet (2010) 20.73

Fast model-based estimation of ancestry in unrelated individuals. Genome Res (2009) 15.63

Structural variation of chromosomes in autism spectrum disorder. Am J Hum Genet (2008) 15.51

Sex and virulence in Escherichia coli: an evolutionary perspective. Mol Microbiol (2006) 14.53

Genes mirror geography within Europe. Nature (2008) 14.23

Methods for high-density admixture mapping of disease genes. Am J Hum Genet (2004) 12.02

Genetic variation in PNPLA3 confers susceptibility to nonalcoholic fatty liver disease. Nat Genet (2008) 10.87

A high-density admixture map for disease gene discovery in african americans. Am J Hum Genet (2004) 10.87

Inferring weak population structure with the assistance of sample group information. Mol Ecol Resour (2009) 10.81

The pattern of polymorphism in Arabidopsis thaliana. PLoS Biol (2005) 10.13

Inference of population structure using multilocus genotype data: dominant markers and null alleles. Mol Ecol Notes (2007) 10.11

Reconstructing Indian population history. Nature (2009) 9.28

A multistage genome-wide association study in breast cancer identifies two new risk alleles at 1p11.2 and 14q24.1 (RAD51L1). Nat Genet (2009) 8.39

Discriminant analysis of principal components: a new method for the analysis of genetically structured populations. BMC Genet (2010) 7.67

Design and analysis of admixture mapping studies. Am J Hum Genet (2004) 7.20

Genomewide rapid association using mixed model and regression: a fast and simple method for genomewide pedigree-based quantitative trait loci association analysis. Genetics (2007) 7.09

An Arabidopsis example of association mapping in structured samples. PLoS Genet (2006) 7.03

An African origin for the intimate association between humans and Helicobacter pylori. Nature (2007) 6.91

Informativeness of genetic markers for inference of ancestry. Am J Hum Genet (2003) 6.90

Clines, clusters, and the effect of study design on the inference of human population structure. PLoS Genet (2005) 6.73

A genome-scan method to identify selected loci appropriate for both dominant and codominant markers: a Bayesian perspective. Genetics (2008) 6.62

Analysis and application of European genetic substructure using 300 K SNP information. PLoS Genet (2008) 6.42

Sensitive detection of chromosomal segments of distinct ancestry in admixed populations. PLoS Genet (2009) 6.36

Reconstructing genetic ancestry blocks in admixed individuals. Am J Hum Genet (2006) 5.92

Estimating local ancestry in admixed populations. Am J Hum Genet (2008) 5.90

Genetic structure and diversity in Oryza sativa L. Genetics (2005) 5.74

Building the sequence map of the human pan-genome. Nat Biotechnol (2009) 5.53

Common variants at 7p21 are associated with frontotemporal lobar degeneration with TDP-43 inclusions. Nat Genet (2010) 5.52

Genome-wide patterns of population structure and admixture in West Africans and African Americans. Proc Natl Acad Sci U S A (2009) 5.39

European population substructure: clustering of northern and southern populations. PLoS Genet (2006) 5.27

Contribution of SHANK3 mutations to autism spectrum disorder. Am J Hum Genet (2007) 5.04

Web-based genome-wide association study identifies two novel loci and a substantial genetic component for Parkinson's disease. PLoS Genet (2011) 5.01

Ancient admixture in human history. Genetics (2012) 4.99

Genetic design and statistical power of nested association mapping in maize. Genetics (2008) 4.92

Genetic variation and population structure in native Americans. PLoS Genet (2007) 4.87

Inference of population structure using dense haplotype data. PLoS Genet (2012) 4.87

Genome-wide association studies in diverse populations. Nat Rev Genet (2010) 4.68

Ancestry informative marker sets for determining continental origin and admixture proportions in common populations in America. Hum Mutat (2009) 4.65

Statistical tests for admixture mapping with case-control and cases-only data. Am J Hum Genet (2004) 4.52

Social supports and serotonin transporter gene moderate depression in maltreated children. Proc Natl Acad Sci U S A (2004) 4.42

A Markov chain Monte Carlo approach for joint inference of population structure and inbreeding rates from multilocus genotype data. Genetics (2007) 4.31

Genome-wide association study confirms SNPs in SNCA and the MAPT region as common risk factors for Parkinson disease. Ann Hum Genet (2010) 4.26

Genome-wide association mapping in Arabidopsis identifies previously known flowering time and pathogen resistance genes. PLoS Genet (2005) 4.25

Enhanced Bayesian modelling in BAPS software for learning genetic structures of populations. BMC Bioinformatics (2008) 4.25

Association mapping of kernel size and milling quality in wheat (Triticum aestivum L.) cultivars. Genetics (2005) 4.19

A spatial statistical model for landscape genetics. Genetics (2004) 4.11

Measuring European population stratification with microarray genotype data. Am J Hum Genet (2007) 4.03

Genome-wide association analysis of susceptibility and clinical phenotype in multiple sclerosis. Hum Mol Genet (2008) 3.93

A new perspective on Listeria monocytogenes evolution. PLoS Pathog (2008) 3.85

A genomewide single-nucleotide-polymorphism panel with high ancestry information for African American admixture mapping. Am J Hum Genet (2006) 3.77

Genome-wide association study of bipolar disorder in European American and African American individuals. Mol Psychiatry (2009) 3.64

Multiplexed shotgun genotyping for rapid and efficient genetic mapping. Genome Res (2011) 3.63

Clonal origin and evolution of a transmissible cancer. Cell (2006) 3.60

A shared susceptibility locus in PLCE1 at 10q23 for gastric adenocarcinoma and esophageal squamous cell carcinoma. Nat Genet (2010) 3.53

Genome-wide comparative diversity uncovers multiple targets of selection for improvement in hexaploid wheat landraces and cultivars. Proc Natl Acad Sci U S A (2013) 3.48

A simple genetic architecture underlies morphological variation in dogs. PLoS Biol (2010) 3.46

Inference of population structure under a Dirichlet process model. Genetics (2007) 3.42

Accounting for ancestry: population substructure and genome-wide association studies. Hum Mol Genet (2008) 3.39

A genomewide single-nucleotide-polymorphism panel for Mexican American admixture mapping. Am J Hum Genet (2007) 3.39

Multilocus patterns of nucleotide variability and the demographic and selection history of Drosophila melanogaster populations. Genome Res (2005) 3.38

Multilocus sequence typing as a replacement for serotyping in Salmonella enterica. PLoS Pathog (2012) 3.37

Loci on 20q13 and 21q22 are associated with pediatric-onset inflammatory bowel disease. Nat Genet (2008) 3.36

Genetic control of human brain transcript expression in Alzheimer disease. Am J Hum Genet (2009) 3.35

Evolution under domestication: ongoing artificial selection and divergence of wild and managed Stenocereus pruinosus (Cactaceae) populations in the Tehuacan Valley, Mexico. Ann Bot (2010) 3.34

The genetic structure of Pacific Islanders. PLoS Genet (2008) 3.31

Stacks: an analysis tool set for population genomics. Mol Ecol (2013) 3.29

Biodiversity in the Cladosporium herbarum complex (Davidiellaceae, Capnodiales), with standardisation of methods for Cladosporium taxonomy and diagnostics. Stud Mycol (2007) 3.25

Colloquium paper: genome-wide patterns of population structure and admixture among Hispanic/Latino populations. Proc Natl Acad Sci U S A (2010) 3.20

Population stratification confounds genetic association studies among Latinos. Hum Genet (2005) 3.19

Recombinational landscape and population genomics of Caenorhabditis elegans. PLoS Genet (2009) 3.14

Origin, spread and demography of the Mycobacterium tuberculosis complex. PLoS Pathog (2008) 3.10

A genetic atlas of human admixture history. Science (2014) 3.09

Genome-wide analysis of single nucleotide polymorphisms uncovers population structure in Northern Europe. PLoS One (2008) 3.09

Geographic patterns of genome admixture in Latin American Mestizos. PLoS Genet (2008) 3.07

Admixture mapping of white cell count: genetic locus responsible for lower white blood cell count in the Health ABC and Jackson Heart studies. Am J Hum Genet (2008) 3.06

Geographical structure and differential natural selection among North European populations. Genome Res (2009) 3.03

A unified association analysis approach for family and unrelated samples correcting for stratification. Am J Hum Genet (2008) 3.00

A genome-wide genotyping study in patients with ischaemic stroke: initial analysis and data release. Lancet Neurol (2007) 2.99

Logistic regression protects against population structure in genetic association studies. Genome Res (2005) 2.94

Recent history of artificial outcrossing facilitates whole-genome association mapping in elite inbred crop varieties. Proc Natl Acad Sci U S A (2006) 2.90

Sequence typing and comparison of population biology of Campylobacter coli and Campylobacter jejuni. J Clin Microbiol (2005) 2.88

Chromosome-scale selective sweeps shape Caenorhabditis elegans genomic diversity. Nat Genet (2012) 2.85

A fast method for computing high-significance disease association in large population-based studies. Am J Hum Genet (2006) 2.84

A cryptic subgroup of Anopheles gambiae is highly susceptible to human malaria parasites. Science (2011) 2.81

The history of African gene flow into Southern Europeans, Levantines, and Jews. PLoS Genet (2011) 2.81

Population-based risk assessment of APOL1 on renal disease. J Am Soc Nephrol (2011) 2.81

Prospects for admixture mapping of complex traits. Am J Hum Genet (2004) 2.73

Examination of ancestry and ethnic affiliation using highly informative diallelic DNA markers: application to diverse and admixed populations and implications for clinical epidemiology and forensic medicine. Hum Genet (2005) 2.67

Inferring human colonization history using a copying model. PLoS Genet (2008) 2.64

A genome-wide survey of R gene polymorphisms in Arabidopsis. Plant Cell (2006) 2.64

Tracing the source of campylobacteriosis. PLoS Genet (2008) 2.64

Genomic diversity and introgression in O. sativa reveal the impact of domestication and breeding on the rice genome. PLoS One (2010) 2.59

The protein kinase Pstol1 from traditional rice confers tolerance of phosphorus deficiency. Nature (2012) 2.57

Clumpak: a program for identifying clustering modes and packaging population structure inferences across K. Mol Ecol Resour (2015) 2.56

Inference of locus-specific ancestry in closely related populations. Bioinformatics (2009) 2.54

Inference of historical changes in migration rate from the lengths of migrant tracts. Genetics (2008) 2.53

Articles cited by this

A new statistical method for haplotype reconstruction from population data. Am J Hum Genet (2001) 59.30

Mathematical model for studying genetic variation in terms of restriction endonucleases. Proc Natl Acad Sci U S A (1979) 41.08

MEGA2: molecular evolutionary genetics analysis software. Bioinformatics (2001) 38.50

Genetic structure of human populations. Science (2002) 30.91

A high-resolution recombination map of the human genome. Nat Genet (2002) 28.66

Estimating African American admixture proportions by use of population-specific alleles. Am J Hum Genet (1998) 22.90

High-resolution haplotype structure in the human genome. Nat Genet (2001) 20.51

Comprehensive human genetic maps: individual and sex-specific variation in recombination. Am J Hum Genet (1998) 19.35

Traces of human migrations in Helicobacter pylori populations. Science (2003) 11.92

Dwarf8 polymorphisms associate with variation in flowering time. Nat Genet (2001) 9.63

A model-based method for identifying species hybrids using multilocus genetic data. Genetics (2002) 8.07

Mapping genes that underlie ethnic differences in disease risk: methods for detecting linkage in admixed populations, by conditioning on parental admixture. Am J Hum Genet (1998) 7.96

Mapping by admixture linkage disequilibrium in human populations: limits and guidelines. Am J Hum Genet (1994) 7.16

Gm3;5,13,14 and type 2 diabetes mellitus: an association in American Indians with genetic admixture. Am J Hum Genet (1988) 7.00

Adaptation, speciation and hybrid zones. Nature (1989) 6.26

Estimation of admixture and detection of linkage in admixed populations by a Bayesian approach: application to African-American populations. Ann Hum Genet (2000) 6.13

Accounting for unmeasured population substructure in case-control studies of genetic association using a novel latent-class model. Am J Hum Genet (2001) 5.66

The genetic structure of admixed populations. Genetics (1991) 5.63

A Bayesian approach to the identification of panmictic populations and the assignment of individuals. Genet Res (2001) 5.61

Hybrid zones and the genetic architecture of a barrier to gene flow between two sunflower species. Genetics (1999) 4.81

Population structure in admixed populations: effect of admixture dynamics on the pattern of linkage disequilibrium. Am J Hum Genet (2000) 4.74

Inferring admixture proportions from molecular data. Mol Biol Evol (1998) 4.04

Genetic diversity and introgression in the Scottish wildcat. Mol Ecol (2001) 3.58

Estimation of admixture proportions: a likelihood-based approach using Markov chain Monte Carlo. Genetics (2001) 3.45

Genome scan among Nigerians linking blood pressure to chromosomes 2, 3, and 19. Hypertension (2002) 2.89

A genome-wide linkage analysis investigating the determinants of blood pressure in whites and African Americans. Am J Hypertens (2003) 2.76

The Icelandic admixture problem. Ann Hum Genet (1973) 2.67

Uralic genes in Europe. Am J Phys Anthropol (1990) 2.57

Microsatellite variation in natural Drosophila melanogaster populations from New South Wales (Australia) and Tasmania. Mol Ecol (2001) 2.41

Conditions which govern the Growth of the Bacillus of "Gas Gangrene" in Artificial Culture Media, in the Blood Fluids in vitro, and in the Dead and Living Organism. Proc R Soc Med (1917) 2.32

Articles by these authors

A second generation human haplotype map of over 3.1 million SNPs. Nature (2007) 85.39

RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. Genome Res (2008) 62.07

Genetic structure of human populations. Science (2002) 30.91

A map of recent positive selection in the human genome. PLoS Biol (2006) 29.19

A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase. Am J Hum Genet (2006) 28.32

Modeling linkage disequilibrium and identifying recombination hotspots using single-nucleotide polymorphism data. Genetics (2003) 17.73

Genome-wide detection and characterization of positive selection in human populations. Nature (2007) 17.27

A high-resolution survey of deletion polymorphism in the human genome. Nat Genet (2005) 16.99

Understanding mechanisms underlying human gene expression variation with RNA sequencing. Nature (2010) 16.86

Imputation-based analysis of association studies: candidate regions and quantitative traits. PLoS Genet (2007) 15.55

Genes mirror geography within Europe. Nature (2008) 14.23

A comparison of phasing algorithms for trios and unrelated individuals. Am J Hum Genet (2006) 12.45

A systematic survey of loss-of-function variants in human protein-coding genes. Science (2012) 12.25

Traces of human migrations in Helicobacter pylori populations. Science (2003) 11.92

Fast and accurate genotype imputation in genome-wide association studies through pre-phasing. Nat Genet (2012) 11.29

Inferring weak population structure with the assistance of sample group information. Mol Ecol Resour (2009) 10.81

Inference of population structure using multilocus genotype data: dominant markers and null alleles. Mol Ecol Notes (2007) 10.11

High-resolution mapping of expression-QTLs yields insight into human gene regulation. PLoS Genet (2008) 9.68

Convergent adaptation of human lactase persistence in Africa and Europe. Nat Genet (2006) 9.44

Genotype imputation with thousands of genomes. G3 (Bethesda) (2011) 8.77

Interpreting principal component analyses of spatial population genetic variation. Nat Genet (2008) 8.49

A worldwide survey of haplotype variation and linkage disequilibrium in the human genome. Nat Genet (2006) 8.46

Signals of recent positive selection in a worldwide sample of human populations. Genome Res (2009) 8.38

Sequencing and analysis of Neanderthal genomic DNA. Science (2006) 8.06

Evidence for substantial fine-scale variation in recombination rates across the human genome. Nat Genet (2004) 6.99

Overcoming the winner's curse: estimating penetrance parameters from case-control data. Am J Hum Genet (2007) 6.99

Informativeness of genetic markers for inference of ancestry. Am J Hum Genet (2003) 6.90

Practical issues in imputation-based association mapping. PLoS Genet (2008) 6.76

Clines, clusters, and the effect of study design on the inference of human population structure. PLoS Genet (2005) 6.73

Genome-wide efficient mixed-model analysis for association studies. Nat Genet (2012) 6.62

DNA methylation patterns associate with genetic and gene expression variation in HapMap cell lines. Genome Biol (2011) 6.48

DNase I sensitivity QTLs are a major determinant of human expression variation. Nature (2012) 6.17

Revealing the architecture of gene regulation: the promise of eQTL studies. Trends Genet (2008) 5.78

Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data. Bioinformatics (2009) 5.62

Accurate inference of transcription factor binding from DNA sequence and chromatin accessibility data. Genome Res (2010) 5.40

Coalescent-based association mapping and fine mapping of complex trait loci. Genetics (2004) 5.26

Haplotype blocks and linkage disequilibrium in the human genome. Nat Rev Genet (2003) 5.16

High-resolution mapping of crossovers reveals extensive variation in fine-scale recombination patterns among humans. Science (2008) 5.02

Inference of population splits and mixtures from genome-wide allele frequency data. PLoS Genet (2012) 4.85

Statistical tests for admixture mapping with case-control and cases-only data. Am J Hum Genet (2004) 4.52

Completing the map of human genetic variation. Nature (2007) 4.38

Polymorphisms of the HNF1A gene encoding hepatocyte nuclear factor-1 alpha are associated with C-reactive protein. Am J Hum Genet (2008) 3.61

Confounding from cryptic relatedness in case-control association studies. PLoS Genet (2005) 3.61

Sex-specific and lineage-specific alternative splicing in primates. Genome Res (2009) 3.61

Clonal origin and evolution of a transmissible cancer. Cell (2006) 3.60

The role of geography in human adaptation. PLoS Genet (2009) 3.41

Noisy splicing drives mRNA isoform diversity in human cells. PLoS Genet (2010) 3.19

Assigning African elephant DNA to geographic region of origin: applications to the ivory trade. Proc Natl Acad Sci U S A (2004) 3.17

Identification of genetic variants that affect histone modifications in human cells. Science (2013) 3.11

Comment on "Widespread RNA and DNA sequence differences in the human transcriptome". Science (2012) 2.79

Adaptations to climate in candidate genes for common metabolic disorders. PLoS Genet (2008) 2.68

Genomics: ENCODE explained. Nature (2012) 2.62

Automating resequencing-based detection of insertion-deletion polymorphisms. Nat Genet (2006) 2.61

Polygenic modeling with bayesian sparse linear mixed models. PLoS Genet (2013) 2.59

Assessing the performance of the haplotype block model of linkage disequilibrium. Am J Hum Genet (2003) 2.59

Using environmental correlations to identify loci underlying local adaptation. Genetics (2010) 2.52

Dissecting the regulatory architecture of gene expression QTLs. Genome Biol (2012) 2.51

A statin-dependent QTL for GATM expression is associated with statin-induced myopathy. Nature (2013) 2.46

A statistical framework for joint eQTL analysis in multiple tissues. PLoS Genet (2013) 2.46

msHOT: modifying Hudson's ms simulator to incorporate crossover and gene conversion hotspots. Bioinformatics (2006) 2.45

Genome-wide association of lipid-lowering response to statins in combined study populations. PLoS One (2010) 2.43

Absence of the TAP2 human recombination hotspot in chimpanzees. PLoS Biol (2004) 2.22

DNA sequence-dependent compartmentalization and silencing of chromatin at the nuclear lamina. Cell (2012) 2.20

Primate transcript and protein expression levels evolve under compensatory selection pressures. Science (2013) 2.18

Adaptations to climate-mediated selective pressures in humans. PLoS Genet (2011) 2.15

Controls of nucleosome positioning in the human genome. PLoS Genet (2012) 2.14

Conservation of hotspots for recombination in low-copy repeats associated with the NF1 microdeletion. Nat Genet (2006) 2.11

Analysis of population structure: a unifying framework and novel methods based on sparse factor analysis. PLoS Genet (2010) 2.06

Global effect of PEG-IFN-alpha and ribavirin on gene expression in PBMC in vitro. J Interferon Cytokine Res (2004) 1.98

A genome-wide study of DNA methylation patterns and gene expression levels in multiple human and chimpanzee tissues. PLoS Genet (2011) 1.92

Evidence for extensive transmission distortion in the human genome. Am J Hum Genet (2003) 1.86

False positive peaks in ChIP-seq and other sequencing-based functional assays caused by unannotated high copy number regions. Bioinformatics (2011) 1.85

Efficient counting of k-mers in DNA sequences using a bloom filter. BMC Bioinformatics (2011) 1.76

Gene expression levels are a target of recent natural selection in the human genome. Mol Biol Evol (2008) 1.63

Using DNA to track the origin of the largest ivory seizure since the 1989 trade ban. Proc Natl Acad Sci U S A (2007) 1.56

Interactions between glucocorticoid treatment and cis-regulatory polymorphisms contribute to cellular response phenotypes. PLoS Genet (2011) 1.49

Next generation analytic tools for large scale genetic epidemiology studies of complex diseases. Genet Epidemiol (2011) 1.47

Adaptive evolution of conserved noncoding elements in mammals. PLoS Genet (2007) 1.45

Characterizing natural variation using next-generation sequencing technologies. Trends Genet (2009) 1.39

Variation in human recombination rates and its genetic determinants. PLoS One (2011) 1.39

The genetic architecture of adaptations to high altitude in Ethiopia. PLoS Genet (2012) 1.38

Comparative RNA sequencing reveals substantial genetic variation in endangered primates. Genome Res (2011) 1.36

Combating the illegal trade in African elephant ivory with DNA forensics. Conserv Biol (2008) 1.34

Fast and accurate estimation of the population-scaled mutation rate, theta, from microsatellite genotype data. Genetics (2007) 1.32

The contribution of RNA decay quantitative trait loci to inter-individual variation in steady-state gene expression levels. PLoS Genet (2012) 1.25

Genome-wide association study of d-amphetamine response in healthy volunteers identifies putative associations, including cadherin 13 (CDH13). PLoS One (2012) 1.14

Integrated enrichment analysis of variants and pathways in genome-wide association studies indicates central role for IL-2 signaling genes in type 1 diabetes, and cytokine signaling genes in Crohn's disease. PLoS Genet (2013) 1.14

USING LINEAR PREDICTORS TO IMPUTE ALLELE FREQUENCIES FROM SUMMARY OR POOLED GENOTYPE DATA. Ann Appl Stat (2010) 1.12

Haplotype variation and genotype imputation in African populations. Genet Epidemiol (2011) 1.10

Functional comparison of innate immune signaling pathways in primates. PLoS Genet (2010) 1.09

Haplotypic background of a private allele at high frequency in the Americas. Mol Biol Evol (2009) 1.08