Please use this identifier to cite or link to this item:
Title: Dealing with missing values in DNA microarray
Authors: CAO YI
Keywords: microarray data mining, missing value imputation, non-parametric regression, principal component analysis, robust regression, clustering analysis
Issue Date: 19-Aug-2008
Citation: CAO YI (2008-08-19). Dealing with missing values in DNA microarray. ScholarBank@NUS Repository.
Abstract: Microarray data has been used in a large number of studies covering a broad range of areas in biology. Missing values are often encountered when analyzing microarray gene expression data. It is essential that the estimates for the missing gene expression values are accurate to make the subsequent analysis as informative as possible.In this study, we first develop nonparametric regression approach (NPRA) for imputation, which can capture both linear and non-linear relations between genes. We further take advantage of relations between arrays to improve imputation accuracy. In order to deal with outliers in microarray, we employ robust principal component analysis (RPCA) imputation method.Furthermore, we construct a missing value imputation framework, by combining the estimates from NPRA and RPCA respectively. Finally, we focus on investigating the impact of missing values and imputation on gene clustering analysis, and justify that clustering accuracy is also a measure to assess imputation methods.
Appears in Collections:Ph.D Theses (Open)

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
CaoY.pdf2.92 MBAdobe PDF



Page view(s)

checked on Apr 18, 2019


checked on Apr 18, 2019

Google ScholarTM


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.