Please use this identifier to cite or link to this item: https://doi.org/10.1080/00031305.2013.817356
DC FieldValue
dc.titleOn the effect and remedies of shrinkage on classification probability estimation
dc.contributor.authorZhang, C.
dc.contributor.authorLiu, Y.
dc.contributor.authorWu, Z.
dc.date.accessioned2014-10-28T05:13:47Z
dc.date.available2014-10-28T05:13:47Z
dc.date.issued2013
dc.identifier.citationZhang, C., Liu, Y., Wu, Z. (2013). On the effect and remedies of shrinkage on classification probability estimation. American Statistician 67 (3) : 134-142. ScholarBank@NUS Repository. https://doi.org/10.1080/00031305.2013.817356
dc.identifier.issn00031305
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/105270
dc.description.abstractShrinkage methods have been shown to be effective for classification problems. As a form of regularization, shrinkage through penalization helps to avoid overfitting and produces accurate classifiers for prediction, especially when the dimension is relatively high. Despite the benefit of shrinkage on classification accuracy of resulting classifiers, in this article, we demonstrate that shrinkage creates biases on classification probability estimation. In many cases, this bias can be large and consequently yield poor class probability estimation when the sample size is small or moderate. We offer some theoretical insights into the effect of shrinkage and provide remedies for better class probability estimation. Using penalized logistic regression and proximal support vector machines as examples, we demonstrate that our proposed refit method gives similar classification accuracy and remarkable improvements on probability estimation on several simulated and real data examples. © 2013 American Statistical Association.
dc.description.urihttp://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1080/00031305.2013.817356
dc.sourceScopus
dc.subjectBias
dc.subjectHigh dimension
dc.subjectRefit
dc.subjectRegularization
dc.typeArticle
dc.contributor.departmentSTATISTICS & APPLIED PROBABILITY
dc.description.doi10.1080/00031305.2013.817356
dc.description.sourcetitleAmerican Statistician
dc.description.volume67
dc.description.issue3
dc.description.page134-142
dc.identifier.isiut000325911700003
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.