Please use this identifier to cite or link to this item: https://doi.org/10.1109/icpr48806.2021.9411917
Title: Learning with Delayed Feedback
Authors: Pranavan, Theivendiram
SIM MONG CHENG,TERENCE 
Issue Date: 10-Jan-2021
Publisher: IEEE
Citation: Pranavan, Theivendiram, SIM MONG CHENG,TERENCE (2021-01-10). Learning with Delayed Feedback. 2020 25th International Conference on Pattern Recognition (ICPR). ScholarBank@NUS Repository. https://doi.org/10.1109/icpr48806.2021.9411917
Abstract: We propose a novel supervised machine learning strategy, inspired by human learning, that enables an Agent to learn continually over its lifetime. A natural consequence is that the Agent must be able to handle an input whose label is delayed until a later time, or may not arrive at all. Our Agent learns in two steps: a short Seeding phase, in which the Agent’s model is initialized with labelled inputs, and an indefinitely long Growing phase, in which the Agent refines and assesses its model if the label is given for an input, but stores the input in a finitelength queue if the label is missing. Queued items are matched against future input-label pairs that arrive, and the model is then updated. Our strategy also allows for the delayed feedback to take a different form. For example, in an image captioning task, the feedback could be a semantic segmentation rather than a textual caption. We show with many experiments that our strategy enables an Agent to learn flexibly and efficiently.
Source Title: 2020 25th International Conference on Pattern Recognition (ICPR)
URI: https://scholarbank.nus.edu.sg/handle/10635/191903
DOI: 10.1109/icpr48806.2021.9411917
Appears in Collections:Staff Publications
Elements

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
Learning_with_Delayed_Feedback.pdf614.26 kBAdobe PDF

OPEN

Post-printView/Download

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.