Please use this identifier to cite or link to this item:
Title: Model selection methods and their application in genome-wide association studies
Keywords: High dimensional model selection,Dimension reduction,SCAD penalty,Extended Bayes information criterion,Genome-wide association study,Gene interactions
Issue Date: 29-Dec-2008
Citation: ZHAO JINGYUAN (2008-12-29). Model selection methods and their application in genome-wide association studies. ScholarBank@NUS Repository.
Abstract: As the data where the number of features greatly exceeds the sample size frequently appear in many areas, high dimensional model selection problem has become common and imminent. We propose the generalized tournament approach cum EBIC for generalized linear models with both main effects and interactions. In screening step, main effects and interactions are screened in consecutive stages until the dimension of features is reduced to a desirable level. In final selection, the modified SCAD method combined with EBIC is developed to select causal features. We show that the modified SCAD method guarantees finite parameter estimates in case of separation. To cater the demand for statistical methods to identify causal SNPs with interaction structure, we apply the generalized tournament procedure in genome-wide association studies. In simulated data, we demonstrate that the generalized tournament approach cum EBIC enjoys higher positive selection rate and lower false discovery rate than interaction-based multiple testing.
Appears in Collections:Ph.D Theses (Open)

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
ZhaoJY.pdf354.87 kBAdobe PDF



Google ScholarTM


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.