Please use this identifier to cite or link to this item:
https://doi.org/10.1038/sdata.2018.136
DC Field | Value | |
---|---|---|
dc.title | A merged lung cancer transcriptome dataset for clinical predictive modeling | |
dc.contributor.author | Lim, S.B | |
dc.contributor.author | Tan, S.J | |
dc.contributor.author | Lim, W.-T | |
dc.contributor.author | Lim, C.T | |
dc.date.accessioned | 2020-09-09T03:09:35Z | |
dc.date.available | 2020-09-09T03:09:35Z | |
dc.date.issued | 2018 | |
dc.identifier.citation | Lim, S.B, Tan, S.J, Lim, W.-T, Lim, C.T (2018). A merged lung cancer transcriptome dataset for clinical predictive modeling. Scientific data 5 : 180136. ScholarBank@NUS Repository. https://doi.org/10.1038/sdata.2018.136 | |
dc.identifier.issn | 20524463 | |
dc.identifier.uri | https://scholarbank.nus.edu.sg/handle/10635/175051 | |
dc.description.abstract | The Gene Expression Omnibus (GEO) database is an excellent public source of whole transcriptomic profiles of multiple cancers. The main challenge is the limited accessibility of such large-scale genomic data to people without a background in bioinformatics or computer science. This presents difficulties in data analysis, sharing and visualization. Here, we present an integrated bioinformatics pipeline and a normalized dataset that has been preprocessed using a robust statistical methodology; allowing others to perform large-scale meta-analysis, without having to conduct time-consuming data mining and statistical correction. Comprising 1,118 patient-derived samples, the normalized dataset includes primary non-small cell lung cancer (NSCLC) tumors and paired normal lung tissues from ten independent GEO datasets, facilitating differential expression analysis. The data has been merged, normalized, batch effect-corrected and filtered for genes with low variance via multiple open source R packages integrated into our workflow. Overall this dataset (with associated clinical metadata) better represents the diseased population and serves as a powerful tool for early predictive biomarker discovery. | |
dc.source | Unpaywall 20200831 | |
dc.subject | transcriptome | |
dc.subject | biology | |
dc.subject | data analysis | |
dc.subject | factual database | |
dc.subject | gene expression profiling | |
dc.subject | genetics | |
dc.subject | human | |
dc.subject | lung tumor | |
dc.subject | non small cell lung cancer | |
dc.subject | procedures | |
dc.subject | Carcinoma, Non-Small-Cell Lung | |
dc.subject | Computational Biology | |
dc.subject | Data Analysis | |
dc.subject | Databases, Factual | |
dc.subject | Gene Expression Profiling | |
dc.subject | Humans | |
dc.subject | Lung Neoplasms | |
dc.subject | Transcriptome | |
dc.type | Article | |
dc.contributor.department | DUKE-NUS MEDICAL SCHOOL | |
dc.contributor.department | BIOENGINEERING | |
dc.description.doi | 10.1038/sdata.2018.136 | |
dc.description.sourcetitle | Scientific data | |
dc.description.volume | 5 | |
dc.description.page | 180136 | |
Appears in Collections: | Elements Staff Publications |
Show simple item record
Files in This Item:
File | Description | Size | Format | Access Settings | Version | |
---|---|---|---|---|---|---|
10_1038_sdata_2018_136.pdf | 2.71 MB | Adobe PDF | OPEN | None | View/Download |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.