Please use this identifier to cite or link to this item: https://doi.org/10.1016/j.patter.2020.100025
DC FieldValue
dc.titleAvoid Oversimplifications in Machine Learning: Going beyond the Class-Prediction Accuracy
dc.contributor.authorHo, Sung Yang
dc.contributor.authorWong, Limsoon
dc.contributor.authorBin Goh, Wilson Wen
dc.date.accessioned2023-06-12T05:02:29Z
dc.date.available2023-06-12T05:02:29Z
dc.date.issued2020-05-08
dc.identifier.citationHo, Sung Yang, Wong, Limsoon, Bin Goh, Wilson Wen (2020-05-08). Avoid Oversimplifications in Machine Learning: Going beyond the Class-Prediction Accuracy. PATTERNS 1 (2). ScholarBank@NUS Repository. https://doi.org/10.1016/j.patter.2020.100025
dc.identifier.issn2666-3899
dc.identifier.urihttps://scholarbank.nus.edu.sg/handle/10635/241856
dc.description.abstractClass-prediction accuracy provides a quick but superficial way of determining classifier performance. It does not inform on the reproducibility of the findings or whether the selected or constructed features used are meaningful and specific. Furthermore, the class-prediction accuracy oversummarizes and does not inform on how training and learning have been accomplished: two classifiers providing the same performance in one validation can disagree on many future validations. It does not provide explainability in its decision-making process and is not objective, as its value is also affected by class proportions in the validation set. Despite these issues, this does not mean we should omit the class-prediction accuracy. Instead, it needs to be enriched with accompanying evidence and tests that supplement and contextualize the reported accuracy. This additional evidence serves as augmentations and can help us perform machine learning better while avoiding naive reliance on oversimplified metrics. There is a huge potential for machine learning, but blind reliance on oversimplified metrics can mislead. Class-prediction accuracy is a common metric used for determining classifier performance. This article provides examples to show how the class-prediction accuracy is superficial and even misleading. We propose some augmentative measures to supplement the class-prediction accuracy. This in turn helps us to better understand the quality of learning of the classifier. Class-prediction accuracy is an evaluative method for machine-learning classifiers. However, this method is simple and may produce spurious interpretations when used without caution. Contextualization, dimensionality reduction approaches, and bootstrapping with Jaccard coefficients are possible strategies that can be used to better inform the learning outcome.
dc.language.isoen
dc.publisherELSEVIER
dc.sourceElements
dc.subjectScience & Technology
dc.subjectTechnology
dc.subjectComputer Science, Artificial Intelligence
dc.subjectComputer Science, Information Systems
dc.subjectComputer Science, Interdisciplinary Applications
dc.subjectComputer Science
dc.typeReview
dc.date.updated2023-06-06T02:10:19Z
dc.contributor.departmentNUS GRADUATE SCHOOL
dc.description.doi10.1016/j.patter.2020.100025
dc.description.sourcetitlePATTERNS
dc.description.volume1
dc.description.issue2
dc.published.statePublished
Appears in Collections:Staff Publications
Elements

Show simple item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
Avoid Oversimplifications in Machine Learning Going beyond the Class-Prediction Accuracy.pdf763.31 kBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.