Please use this identifier to cite or link to this item: https://doi.org/10.1214/17-BA1093
Title: Modeling Population Structure Under Hierarchical Dirichlet Processes
Authors: ELLIOTT, LLOYD T
DE IORIO, MARIA 
FAVARO, STEFANO
ADHIKARI, KAUSTUBH
TEH, YEE WHYE 
Keywords: Science & Technology
Physical Sciences
Mathematics, Interdisciplinary Applications
Statistics & Probability
Mathematics
admixture modeling
Bayesian nonparametrics
hierarchical Dirichlet process
linkage disequilibrium
population stratification
single nucleotide polymorphism data
MCMC algorithm
MONTE-CARLO METHODS
BAYESIAN-ANALYSIS
GENETIC-STRUCTURE
INFERENCE
DIFFERENTIATION
ADMIXTURE
ANCESTRY
CRITERIA
DISEASE
HAIR
Issue Date: 1-Jun-2019
Publisher: INT SOC BAYESIAN ANALYSIS
Citation: ELLIOTT, LLOYD T, DE IORIO, MARIA, FAVARO, STEFANO, ADHIKARI, KAUSTUBH, TEH, YEE WHYE (2019-06-01). Modeling Population Structure Under Hierarchical Dirichlet Processes. BAYESIAN ANALYSIS 14 (2) : 313-339. ScholarBank@NUS Repository. https://doi.org/10.1214/17-BA1093
Abstract: We propose a Bayesian nonparametric model to infer population admixture, extending the Hierarchical Dirichlet Process to allow for correlation between loci due to Linkage Disequilibrium. Given multilocus genotype data from a sample of individuals, the model allows inferring classifying individuals as unadmixed or admixed, inferring the number of subpopulations ancestral to an admixed population and the population of origin of chromosomal regions. Our model does not assume any specific mutation process and can be applied to most of the commonly used genetic markers. We present a MCMC algorithm to perform posterior inference from the model and discuss methods to summarise the MCMC output for the analysis of population admixture. We demonstrate the performance of the proposed model in simulations and in a real application, using genetic data from the EDAR gene, which is considered to be ancestry-informative due to well-known variations in allele frequency as well as phenotypic effects across ancestry. The structure analysis of this dataset leads to the identification of a rare haplotype in Europeans.
Source Title: BAYESIAN ANALYSIS
URI: https://scholarbank.nus.edu.sg/handle/10635/155349
ISSN: 1931-6690
1936-0975
DOI: 10.1214/17-BA1093
Appears in Collections:Staff Publications
Elements

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
1503.08278v1.pdf842.5 kBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.