Please use this identifier to cite or link to this item: https://doi.org/10.1038/s41746-020-0247-1
DC FieldValue
dc.titleTechnical and imaging factors influencing performance of deep learning systems for diabetic retinopathy
dc.contributor.authorYip, M.Y.T.
dc.contributor.authorLim, G.
dc.contributor.authorLim, Z.W.
dc.contributor.authorNguyen, Q.D.
dc.contributor.authorChong, C.C.Y.
dc.contributor.authorYu, M.
dc.contributor.authorBellemo, V.
dc.contributor.authorXie, Y.
dc.contributor.authorLee, X.Q.
dc.contributor.authorHamzah, H.
dc.contributor.authorHo, J.
dc.contributor.authorTan, T.-E.
dc.contributor.authorSabanayagam, C.
dc.contributor.authorGrzybowski, A.
dc.contributor.authorTan, G.S.W.
dc.contributor.authorHsu, W.
dc.contributor.authorLee, M.L.
dc.contributor.authorWong, T.Y.
dc.contributor.authorTing, D.S.W.
dc.date.accessioned2021-08-23T03:15:20Z
dc.date.available2021-08-23T03:15:20Z
dc.date.issued2020-03-23
dc.identifier.citationYip, M.Y.T., Lim, G., Lim, Z.W., Nguyen, Q.D., Chong, C.C.Y., Yu, M., Bellemo, V., Xie, Y., Lee, X.Q., Hamzah, H., Ho, J., Tan, T.-E., Sabanayagam, C., Grzybowski, A., Tan, G.S.W., Hsu, W., Lee, M.L., Wong, T.Y., Ting, D.S.W. (2020-03-23). Technical and imaging factors influencing performance of deep learning systems for diabetic retinopathy. npj Digital Medicine 3 (1) : 40. ScholarBank@NUS Repository. https://doi.org/10.1038/s41746-020-0247-1
dc.identifier.issn23986352
dc.identifier.urihttps://scholarbank.nus.edu.sg/handle/10635/198637
dc.description.abstractDeep learning (DL) has been shown to be effective in developing diabetic retinopathy (DR) algorithms, possibly tackling financial and manpower challenges hindering implementation of DR screening. However, our systematic review of the literature reveals few studies studied the impact of different factors on these DL algorithms, that are important for clinical deployment in real-world settings. Using 455, 491 retinal images, we evaluated two technical and three image-related factors in detection of referable DR. For technical factors, the performances of four DL models (VGGNet, ResNet, DenseNet, Ensemble) and two computational frameworks (Caffe, TensorFlow) were evaluated while for image-related factors, we evaluated image compression levels (reducing image size, 350, 300, 250, 200, 150 KB), number of fields (7-field, 2-field, 1-field) and media clarity (pseudophakic vs phakic). In detection of referable DR, four DL models showed comparable diagnostic performance (AUC 0.936-0.944). To develop the VGGNet model, two computational frameworks had similar AUC (0.936). The DL performance dropped when image size decreased below 250 KB (AUC 0.936, 0.900, p < 0.001). The DL performance performed better when there were increased number of fields (dataset 1: 2-field vs 1-field—AUC 0.936 vs 0.908, p < 0.001; dataset 2: 7-field vs 2-field vs 1-field, AUC 0.949 vs 0.911 vs 0.895). DL performed better in the pseudophakic than phakic eyes (AUC 0.918 vs 0.833, p < 0.001). Various image-related factors play more significant roles than technical factors in determining the diagnostic performance, suggesting the importance of having robust training and testing datasets for DL training and deployment in the real-world settings. © 2020, The Author(s).
dc.publisherNature Research
dc.rightsAttribution 4.0 International
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.sourceScopus OA2020
dc.typeArticle
dc.contributor.departmentDUKE-NUS MEDICAL SCHOOL
dc.contributor.departmentDEPARTMENT OF COMPUTER SCIENCE
dc.description.doi10.1038/s41746-020-0247-1
dc.description.sourcetitlenpj Digital Medicine
dc.description.volume3
dc.description.issue1
dc.description.page40
Appears in Collections:Staff Publications
Elements

Show simple item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
10_1038_s41746_020_0247_1.pdf1.89 MBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check

Altmetric


This item is licensed under a Creative Commons License Creative Commons