Please use this identifier to cite or link to this item: https://doi.org/10.1109/icpr48806.2021.9411917
DC FieldValue
dc.titleLearning with Delayed Feedback
dc.contributor.authorPranavan, Theivendiram
dc.contributor.authorSIM MONG CHENG,TERENCE
dc.date.accessioned2021-06-09T01:21:27Z
dc.date.available2021-06-09T01:21:27Z
dc.date.issued2021-01-10
dc.identifier.citationPranavan, Theivendiram, SIM MONG CHENG,TERENCE (2021-01-10). Learning with Delayed Feedback. 2020 25th International Conference on Pattern Recognition (ICPR). ScholarBank@NUS Repository. https://doi.org/10.1109/icpr48806.2021.9411917
dc.identifier.urihttps://scholarbank.nus.edu.sg/handle/10635/191903
dc.description.abstractWe propose a novel supervised machine learning strategy, inspired by human learning, that enables an Agent to learn continually over its lifetime. A natural consequence is that the Agent must be able to handle an input whose label is delayed until a later time, or may not arrive at all. Our Agent learns in two steps: a short Seeding phase, in which the Agent’s model is initialized with labelled inputs, and an indefinitely long Growing phase, in which the Agent refines and assesses its model if the label is given for an input, but stores the input in a finitelength queue if the label is missing. Queued items are matched against future input-label pairs that arrive, and the model is then updated. Our strategy also allows for the delayed feedback to take a different form. For example, in an image captioning task, the feedback could be a semantic segmentation rather than a textual caption. We show with many experiments that our strategy enables an Agent to learn flexibly and efficiently.
dc.publisherIEEE
dc.sourceElements
dc.typeConference Paper
dc.date.updated2021-06-08T09:11:18Z
dc.contributor.departmentDEPARTMENT OF COMPUTER SCIENCE
dc.description.doi10.1109/icpr48806.2021.9411917
dc.description.sourcetitle2020 25th International Conference on Pattern Recognition (ICPR)
dc.published.statePublished
Appears in Collections:Staff Publications
Elements

Show simple item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
Learning_with_Delayed_Feedback.pdf614.26 kBAdobe PDF

OPEN

Post-printView/Download

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.