Please use this identifier to cite or link to this item:
https://scholarbank.nus.edu.sg/handle/10635/155345
DC Field | Value | |
---|---|---|
dc.title | A semi-automatic method to guide the choice of ridge parameter in ridge regression | |
dc.contributor.author | CULE, ERIKA | |
dc.contributor.author | IORIO, MARIA DE | |
dc.date.accessioned | 2019-06-07T01:54:31Z | |
dc.date.available | 2019-06-07T01:54:31Z | |
dc.date.issued | 2012 | |
dc.identifier.citation | CULE, ERIKA, IORIO, MARIA DE (2012). A semi-automatic method to guide the choice of ridge parameter in ridge regression. Annals of Applied Statistics. ScholarBank@NUS Repository. | |
dc.identifier.issn | 1932-6157 | |
dc.identifier.issn | 1941-7330 | |
dc.identifier.uri | https://scholarbank.nus.edu.sg/handle/10635/155345 | |
dc.description.abstract | We consider the application of a popular penalised regression method, Ridge Regression, to data with very high dimensions and many more covariates than observations. Our motivation is the problem of out-of-sample prediction and the setting is high-density genotype data from a genome-wide association or resequencing study. Ridge regression has previously been shown to offer improved performance for prediction when compared with other penalised regression methods. One problem with ridge regression is the choice of an appropriate parameter for controlling the amount of shrinkage of the coefficient estimates. Here we propose a method for choosing the ridge parameter based on controlling the variance of the predicted observations in the model. Using simulated data, we demonstrate that our method outperforms subset selection based on univariate tests of association and another penalised regression method, HyperLasso regression, in terms of improved prediction error. We extend our approach to regression problems when the outcomes are binary (representing cases and controls, as is typically the setting for genome-wide association studies) and demonstrate the method on a real data example consisting of case-control and genotype data on Bipolar Disorder, taken from the Wellcome Trust Case Control Consortium and the Genetic Association Information Network. | |
dc.source | Elements | |
dc.subject | stat.AP | |
dc.subject | q-bio.GN | |
dc.type | Article | |
dc.date.updated | 2019-06-03T23:49:02Z | |
dc.contributor.department | YALE-NUS COLLEGE | |
dc.description.sourcetitle | Annals of Applied Statistics | |
dc.published.state | Unpublished | |
Appears in Collections: | Staff Publications Elements |
Show simple item record
Files in This Item:
File | Description | Size | Format | Access Settings | Version | |
---|---|---|---|---|---|---|
1205.0686v1.pdf | 500.55 kB | Adobe PDF | OPEN | None | View/Download |
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.