Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/163550
Title: EXAMINING REPLICABILITY OF MACHINE LEARNING METHODS FOR VARIABLE IMPORTANCE AND PERFORMANCE MEASURES
Authors: RANJITH VIJAYAKUMAR
Keywords: machine learning, replicability, variable selection, predictive accuracy, data mining, model comparison
Issue Date: 5-Aug-2019
Citation: RANJITH VIJAYAKUMAR (2019-08-05). EXAMINING REPLICABILITY OF MACHINE LEARNING METHODS FOR VARIABLE IMPORTANCE AND PERFORMANCE MEASURES. ScholarBank@NUS Repository.
Abstract: This thesis attempts to examine the replicability of machine learning methods in two different aspects. The first set of studies examined the replicability of variable importance using empirical and simulated data with linear interaction and square terms. Machine learning performed similarly to regression for linear and interaction on predictive accuracy and variable importance; for square data, machine learning methods performed better in both predictive accuracy and variable importance. However, for simulations on empirical data, this congruency was lost: better predictive accuracy did not lead to better replicability of variables. The second set of studies assessed the replicability of a common effect size of interest in machine learning methods, adapting replication procedures in psychology for these means. An empirical dataset was used to illustrate these methods; the results tentatively suggest that replication of explained variance is not guaranteed for machine learning, and different goals of replication can lead to divergent findings.
URI: https://scholarbank.nus.edu.sg/handle/10635/163550
Appears in Collections:Ph.D Theses (Open)

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
VijayakumarR.pdf3.68 MBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.