Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/119798
Title: Online Learning and Planning of Dynamical Systems Using Gaussian Processes
Authors: ANKIT GOYAL
Keywords: Reinforcement learning, Gaussian processes, online search based planning, structured HIV treatment, Swing-up pendulum control, Machine learning
Issue Date: 13-Jan-2015
Citation: ANKIT GOYAL (2015-01-13). Online Learning and Planning of Dynamical Systems Using Gaussian Processes. ScholarBank@NUS Repository.
Abstract: Decision-making problems with complicated and/or partially unknown underlying generative process and limited data has been quite pervasive in several research areas including robotics, automatic control, operations research, artificial intelligence, economics, medicine etc. In such areas, we can take great advantage from algorithms that learn from data and aid decision making. Over years, Reinforcement learning (RL) has been emerged as a general computational framework to the goal-directed experience-based learning for sequential decision making under uncertainty. However, with no task-specific knowledge, it often lacks efficiency in terms of the number of required samples. This lack of sample efficiency makes RL inapplicable to many real world problems. Thus, a central challenge in RL is how to extract more information from available experience to facilitate fast learning with little data. The contribution of this dissertation are: ? Proposal of (online) sequential (or non-episodic) reinforcement learning frame- work for modeling a variety of single agent problems and algorithms. ? Systematic treatment of model bias for sample efficiency by using Gaussian processes for model learning and using the uncertainty information for long term prediction in the planning algorithms. ? Empirical evaluation of the results for the swing-up control of simple pendulum and designing suitable (interrupted) drug strategies for HIV infected patient.
URI: http://scholarbank.nus.edu.sg/handle/10635/119798
Appears in Collections:Master's Theses (Open)

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
AnkitGoyal.pdf2.15 MBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.