Please use this identifier to cite or link to this item: http://scholarbank.nus.edu.sg/handle/10635/134430
Title: SCALE-ROBUST DEEP LEARNING FOR VISUAL RECOGNITION
Authors: JIE ZEQUN
Keywords: deep learning, visual recognition
Issue Date: 17-Aug-2016
Citation: JIE ZEQUN (2016-08-17). SCALE-ROBUST DEEP LEARNING FOR VISUAL RECOGNITION. ScholarBank@NUS Repository.
Abstract: In recent years, deep learning has achieved great progress in almost all the visual recognition tasks. Nevertheless, deep learning lacks both image-level and object-level scale-robustness, making it difficult to handle the recognition tasks where testing images are in wide range of scales or contain objects with significantly diverse scales. In this thesis, we focus on improving both image-level and object-level scale-robustness for deep learning, leading to better recognition performance faced with the images and objects having large scale ranges. First, scene recognition requires scale invariance for better recognizing the captured images of diverse scales. To achieve scale invariance for scene recognition, we proposed a framework integrating the recent powerful deep convolutional networks and locality-constrained linear coding. Second, we proposed an end-to-end object detection framework based on fully convolutional networks (FCN) to detect vehicles and pedestrians. Third, existing localization strategies generally fail in producing satisfying localization accuracy for small objects. We thus proposed a novel scale-aware pixel-wise object proposal network to tackle the challenges. Fourth, in object detection, it is common that multiple objects are shown in one captured image. Existing localization algorithms usually search for possible object regions over multiple locations and scales separately, which ignore the interdependency among different objects. To incorporate global interdependency between objects into localization, we propose an effective Tree-structured Reinforcement Learning (Tree-RL) approach to sequentially search for objects by fully exploiting both the current observation and historical search paths.
URI: http://scholarbank.nus.edu.sg/handle/10635/134430
Appears in Collections:Ph.D Theses (Open)

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
JieZ.pdf27.87 MBAdobe PDF

OPEN

NoneView/Download

Page view(s)

167
checked on Dec 6, 2018

Download(s)

2,048
checked on Dec 6, 2018

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.