Please use this identifier to cite or link to this item: https://doi.org/10.1145/3408308.3427612
DC FieldValue
dc.titleBalancing thermal comfort datasets: We GAN, but should we?
dc.contributor.authorQuintana, M
dc.contributor.authorSchiavon, S
dc.contributor.authorTham, KW
dc.contributor.authorMiller, C
dc.date.accessioned2021-04-15T04:58:13Z
dc.date.available2021-04-15T04:58:13Z
dc.date.issued2020-11-18
dc.identifier.citationQuintana, M, Schiavon, S, Tham, KW, Miller, C (2020-11-18). Balancing thermal comfort datasets: We GAN, but should we?. BuildSys 2020 - Proceedings of the 7th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation : 120-129. ScholarBank@NUS Repository. https://doi.org/10.1145/3408308.3427612
dc.identifier.isbn9781450380614
dc.identifier.urihttps://scholarbank.nus.edu.sg/handle/10635/189363
dc.description.abstractThermal comfort assessment for the built environment has become more available to analysts and researchers due to the proliferation of sensors and subjective feedback methods. These data can be used for modeling comfort behavior to support design and operations towards energy efficiency and well-being. By nature, occupant subjective feedback is imbalanced as indoor conditions are designed for comfort, and responses indicating otherwise are less common. This situation creates a scenario for the machine learning workflow where class balancing as a pre-processing step might be valuable for developing predictive thermal comfort classification models with high-performance. This paper investigates the various thermal comfort dataset class balancing techniques from the literature and proposes a modified conditional Generative Adversarial Network (GAN), comfortGAN, to address this imbalance scenario. These approaches are applied to three publicly available datasets, ranging from 30 and 67 participants to a global collection of thermal comfort datasets, with 1,474; 2,067; and 66,397 data points, respectively. This work finds that a classification model trained on a balanced dataset, comprised of real and generated samples from comfortGAN, has higher performance (increase between 4% and 17% in classification accuracy) than other augmentation methods tested. However, when classes representing discomfort are merged and reduced to three, better imbalanced performance is expected, and the additional increase in performance by comfortGAN shrinks to 1 - 2%. These results illustrate that class balancing for thermal comfort modeling is beneficial using advanced techniques such as GANs, but its value is diminished in certain scenarios. A discussion is provided to assist potential users in determining which scenarios this process is useful and which method works best.
dc.publisherACM
dc.sourceElements
dc.subjectData Augmentation
dc.subjectClass Balancing
dc.subjectThermal Comfort
dc.subjectGenerative Adversarial Network
dc.subjectSurvey
dc.subjectBuildings
dc.subjectMachine Learning
dc.typeArticle
dc.date.updated2021-04-15T02:33:26Z
dc.contributor.departmentBUILDING
dc.description.doi10.1145/3408308.3427612
dc.description.sourcetitleBuildSys 2020 - Proceedings of the 7th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation
dc.description.page120-129
dc.published.statePublished
Appears in Collections:Elements
Staff Publications

Show simple item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
2009.13154v1.pdfAccepted version1.28 MBAdobe PDF

OPEN

Post-printView/Download

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.