Please use this identifier to cite or link to this item: https://doi.org/10.1016/j.ebiom.2021.103800
Title: Functional coding haplotypes and machine-learning feature elimination identifies predictors of Methotrexate Response in Rheumatoid Arthritis patients
Authors: Lim, Ashley JW 
Lim, Lee Jin 
Ooi, Brandon NS 
Koh, Ee Tzun
Tan, Justina Wei Lynn
Chong, Samuel S 
Khor, Chiea Chuen 
Tucker-Kellogg, Lisa 
Leong, Khai Pang
Lee, Caroline G 
Keywords: Science & Technology
Life Sciences & Biomedicine
Medicine, General & Internal
Medicine, Research & Experimental
General & Internal Medicine
Research & Experimental Medicine
Rheumatoid Arthritis
Methotrexate
Genetic polymorphism
Machine learning
Feature selection
Haplotypes
WHOLE-GENOME ASSOCIATION
QUALITY-OF-LIFE
PHYSICAL FUNCTION
DISEASE-ACTIVITY
WORK DISABILITY
CLASSIFICATION
SELECTION
SENSITIVITY
MANAGEMENT
DURATION
Issue Date: 1-Jan-2022
Publisher: ELSEVIER
Citation: Lim, Ashley JW, Lim, Lee Jin, Ooi, Brandon NS, Koh, Ee Tzun, Tan, Justina Wei Lynn, Chong, Samuel S, Khor, Chiea Chuen, Tucker-Kellogg, Lisa, Leong, Khai Pang, Lee, Caroline G (2022-01-01). Functional coding haplotypes and machine-learning feature elimination identifies predictors of Methotrexate Response in Rheumatoid Arthritis patients. EBIOMEDICINE 75. ScholarBank@NUS Repository. https://doi.org/10.1016/j.ebiom.2021.103800
Abstract: Background: Major challenges in large scale genetic association studies include not only the identification of causative single nucleotide polymorphisms (SNPs), but also accounting for SNP-SNP interactions. This study thus proposes a novel feature engineering approach integrating potentially functional coding haplotypes (pfcHap) with machine-learning (ML) feature selection to identify biologically meaningful, possibly causative genetic factors, that take into consideration potential SNP-SNP interactions within the pfcHap, to best predict for methotrexate (MTX) response in rheumatoid arthritis (RA) patients. Methods: Exome sequencing from 349 RA patients were analysed, of which they were split into training and unseen test set. Inferred pfcHaps were combined with 30 non-genetic features to undergo ML recursive feature elimination with cross-validation using the training set. Predictive capacity and robustness of the selected features were assessed using six popular machine learning models through a train set cross-validation and evaluated in an unseen test set. Findings: Significantly, 100 features (95 pfcHaps, 5 non-genetic factors) were identified to have good predictive performance (AUC: 0.776-0.828; Sensitivity: 0.656-0.813; Specificity: 0.684-0.868) across all six ML models in an unseen test dataset for the prediction of MTX response in RA patients. Interpretation: Majority of the predictive pfcHap SNPs were predicted to be potentially functional and some of the genes in which the pfcHap resides in were identified to be associated with previously reported MTX/RA pathways. Funding: Singapore Ministry of Health's National Medical Research Council (NMRC) [NMRC/CBRG/0095/2015; CG12Aug17; CGAug16M012; NMRC/CG/017/2013]; National Cancer Center Research Fund and block funding Duke-NUS Medical School.; Singapore Ministry of Education Academic Research Fund Tier 2 grant MOE2019-T2-1-138.
Source Title: EBIOMEDICINE
URI: https://scholarbank.nus.edu.sg/handle/10635/226777
ISSN: 23523964
DOI: 10.1016/j.ebiom.2021.103800
Appears in Collections:Staff Publications
Elements

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
Ashley_EBioMedicine.pdfPublished version3.37 MBAdobe PDF

OPEN

PublishedView/Download
Functional coding haplotypes and machine-learning feature elimination identifies predictors of Methotrexate Response in Rheu.pdfPublished version3.37 MBAdobe PDF

OPEN

PublishedView/Download

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.