Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/204930
Title: TOWARDS A CLINICAL PATHOGEN IDENTIFICATION SYSTEM USING DEEP LEARNING AND BAYESIAN INFERENCE ON HIGH PERFORMANCE COMPUTING PLATFORMS
Authors: MA HAORAN
Keywords: Pathogen identification, high performance computing, convolutional neural networks, taxonomic assignment, human viruses, Bayesian inference
Issue Date: 18-Jan-2021
Citation: MA HAORAN (2021-01-18). TOWARDS A CLINICAL PATHOGEN IDENTIFICATION SYSTEM USING DEEP LEARNING AND BAYESIAN INFERENCE ON HIGH PERFORMANCE COMPUTING PLATFORMS. ScholarBank@NUS Repository.
Abstract: The advent of next generation sequencing (NGS) has provided a powerful approach to identify pathogens through bioinformatics pipelines. However, there are several limitations of current bioinformatics tools: (i) the scalability is usually not optimized (ii) the accuracy on divergent species is limited and (iii) the interpretation of a pathogen report may not be intuitive for clinicians. To address the scalability of bioinformatics pipelines, we applied profiling and different optimization strategies for high performance computing (HPC) systems on SURPI pipeline and successfully accelerated it and demonstrated its scalability. To address the accuracy of taxonomic assignment, we developed a multi-task convolutional neural network (MT-CNN) to perform both pathogen identification and genomic coverage estimation for human viruses which outperformed current state-of-the-art tools. Finally, we explored how the interpretability of pathogen reports could be enhanced by incorporating a Bayesian approach to assigning the likelihood of a pathogen based on relevant factors.
URI: https://scholarbank.nus.edu.sg/handle/10635/204930
Appears in Collections:Ph.D Theses (Open)

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
Thesis_MaHaoran_A0161142W.pdf6.56 MBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.