Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/231419
DC FieldValue
dc.titleVARIATIONAL DISTRIBUTION DESIGNS FOR APPROXIMATE THOMPSON SAMPLING IN DEEP REINFORCEMENT LEARNING
dc.contributor.authorSIDDHARTH ARAVINDAN
dc.date.accessioned2022-09-27T18:00:26Z
dc.date.available2022-09-27T18:00:26Z
dc.date.issued2022-01-23
dc.identifier.citationSIDDHARTH ARAVINDAN (2022-01-23). VARIATIONAL DISTRIBUTION DESIGNS FOR APPROXIMATE THOMPSON SAMPLING IN DEEP REINFORCEMENT LEARNING. ScholarBank@NUS Repository.
dc.identifier.urihttps://scholarbank.nus.edu.sg/handle/10635/231419
dc.description.abstractExploration is a vital ingredient in reinforcement learning algorithms that has largely contributed to its success in various applications. Standard naive exploration strategies used in deep reinforcement learning are effective in simple tasks, but do not perform well in tasks with high dimensional state-action spaces as they are undirected. Thompson sampling is a directed, well-known and principled approach for balancing exploration and exploitation. But it requires the posterior distribution over the action-value functions or environment models to be maintained; this is generally computationally intractable for tasks that have a high dimensional state-action space. In this thesis, we argue that incorporating domain knowledge during the formulation of variational distributions for approximating these posterior distributions is useful in reinforcement learning. We explore this assertion by designing variational distributions, namely SANE and EVaDE for two different scenarios in the model-free and model-based reinforcement learning settings respectively.
dc.language.isoen
dc.subjectExploration, Thompson Sampling, Reinforcement Learning, Variational Learning, Deep Learning, Artifical Intelligence
dc.typeThesis
dc.contributor.departmentCOMPUTER SCIENCE
dc.contributor.supervisorWee Sun Lee
dc.description.degreePh.D
dc.description.degreeconferredDOCTOR OF PHILOSOPHY (SOC)
dc.identifier.orcid0000-0002-1782-5936
Appears in Collections:Ph.D Theses (Open)

Show simple item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
AravindanS.pdf10.73 MBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.