Please use this identifier to cite or link to this item:
Title: Estimation Based on Pooled Data in Human Biomonitoring and Statistical Genetics
Authors: LI XIANG
Keywords: biomonitoring, collapsed data, EM algorithm, Gaussian likelihood, haplotype frequency estimation, pooled samples
Issue Date: 6-May-2014
Citation: LI XIANG (2014-05-06). Estimation Based on Pooled Data in Human Biomonitoring and Statistical Genetics. ScholarBank@NUS Repository.
Abstract: Pooling is a cost-effective way to collect data. However, estimation is complicated by the often intractable distributions of the observed pool averages. In this thesis, we consider two applications involving pooled data. The first is to use aggregate data collected from pools of individuals to estimate the levels of individual exposure for various environmental biochemicals. We propose a quasi empirical Bayes estimation approach based on a Gaussian working likelihood which enables pooling of information across different demographic groups. The new estimator out-performs an existing estimator in simulation studies. We consider haplotype frequency estimation from pooled genotype data in our second application. A quick collapsed data estimator is proposed which does not lose much efficiency for rare genetic variants. For more efficient estimates, we propose a way to construct a data-based list of possible haplotypes to be used in conjunction with the expectation-maximization (EM) algorithm to make it more feasible computationally.
Appears in Collections:Ph.D Theses (Open)

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
LiX.pdf936.03 kBAdobe PDF



Page view(s)

checked on Feb 17, 2020


checked on Feb 17, 2020

Google ScholarTM


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.