Please use this identifier to cite or link to this item: https://doi.org/10.1186/s12918-018-0538-1
Title: SPSNet: Subpopulation-sensitive network-based analysis of heterogeneous gene expression data
Authors: Belorkar, A
Vadigepalli, R
Wong, L 
Keywords: acute lymphoblastic leukemia
false positive result
gene expression profiling
genetics
human
liver cell carcinoma
liver tumor
phenotype
procedures
Carcinoma, Hepatocellular
False Positive Reactions
Gene Expression Profiling
Humans
Liver Neoplasms
Phenotype
Precursor Cell Lymphoblastic Leukemia-Lymphoma
Issue Date: 2018
Citation: Belorkar, A, Vadigepalli, R, Wong, L (2018). SPSNet: Subpopulation-sensitive network-based analysis of heterogeneous gene expression data. BMC Systems Biology 12 : 28. ScholarBank@NUS Repository. https://doi.org/10.1186/s12918-018-0538-1
Abstract: Background: Transcriptomic datasets often contain undeclared heterogeneity arising from biological variation such as diversity of disease subtypes, treatment subgroups, time-series gene expression, nested experimental conditions, as well as technical variation due to batch effects, platform differences in integrated meta-analyses, etc. However, current analysis approaches are primarily designed to handle comparisons between experimental conditions represented by homogeneous samples, thus precluding the discovery of underlying subphenotypes. Unsupervised methods for subtype identification are typically based on individual gene level analysis, which often result in irreproducible gene signatures for potential subtypes. Emerging methods to study heterogeneity have been largely developed in the context of single-cell datasets containing hundreds to thousands of samples, limiting their use to select contexts. Results: We present a novel analysis method, SPSNet, which identifies subtype-specific gene expression signatures based on the activity of subnetworks in biological pathways. SPSNet identifies the gene subnetworks capturing the diversity of underlying biological mechanisms, indicating potential sample subphenotypes. In the presence of extrinsic or non-biological heterogeneity (e.g. batch effects), SPSNet identifies subnetworks that are particularly affected by such variation, thus helping eliminate factors irrelevant to the biology of the phenotypes under study. Conclusion: Using multiple publicly available datasets, we illustrate that SPSNet is able to consistently uncover patterns within gene expression data that correspond to meaningful heterogeneity of various origins. We also demonstrate the performance of SPSNet as a sensitive and reliable tool for understanding the structure and nature of such heterogeneity. © 2018 The Author(s).
Source Title: BMC Systems Biology
URI: https://scholarbank.nus.edu.sg/handle/10635/173733
ISSN: 17520509
DOI: 10.1186/s12918-018-0538-1
Appears in Collections:Staff Publications
Elements

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
10_1186_s12918-018-0538-1.pdf2.89 MBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.