Please use this identifier to cite or link to this item: https://doi.org/10.1198/016214504000001466
DC FieldValue
dc.titleOutlier labeling with boxplot procedures
dc.contributor.authorSim, C.H.
dc.contributor.authorGan, F.F.
dc.contributor.authorChang, T.C.
dc.date.accessioned2014-10-28T05:14:11Z
dc.date.available2014-10-28T05:14:11Z
dc.date.issued2005-06
dc.identifier.citationSim, C.H., Gan, F.F., Chang, T.C. (2005-06). Outlier labeling with boxplot procedures. Journal of the American Statistical Association 100 (470) : 642-652. ScholarBank@NUS Repository. https://doi.org/10.1198/016214504000001466
dc.identifier.issn01621459
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/105291
dc.description.abstractIn this article we focus on the detection of possible outliers based on the widely used boxplot procedures. The outliers in a set of data are defined to be a subset of observations that appear to be inconsistent with the remaining observations. We identify the outliers by constructing a boxplot with its lower fence (LF) and upper fence (UF) either (a) satisfying the requirement that if the given sample is outlier-free, then the probability that one or more of the sample data would fall outside the region (LF, UF) is equal to a prescribed small value α, or (b) taken to be the tolerance limits, derived from an outlier-free random sample, within which a specified large proportion β of the sampled population would be asserted to fall with a given large probability γ. Exact expressions that can be routinely used to evaluate the constants needed in the construction of the boxplot's outlier region for samples taken from the family of location-scale distributions are obtained for both procedures. This article shows that the commonly constructed boxplot is in general inappropriate for detecting outliers in the normal and especially the exponential samples. We recommend that the graphical boxplot be constructed based on the knowledge of the underlying distribution of the dataset and by condoling the risk of labeling regular observations as outliers. © 2005 American Statistical Association.
dc.description.urihttp://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1198/016214504000001466
dc.sourceScopus
dc.subjectExploratory data analysis
dc.subjectOlder statistics
dc.subjectOutlier region
dc.subjectTolerance limits
dc.typeArticle
dc.contributor.departmentSTATISTICS & APPLIED PROBABILITY
dc.description.doi10.1198/016214504000001466
dc.description.sourcetitleJournal of the American Statistical Association
dc.description.volume100
dc.description.issue470
dc.description.page642-652
dc.identifier.isiut000233311300025
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

SCOPUSTM   
Citations

46
checked on Dec 6, 2022

WEB OF SCIENCETM
Citations

42
checked on Dec 6, 2022

Page view(s)

216
checked on Dec 8, 2022

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.