Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/159893
Title: EFFICIENT DEEP CONVOLUTIONAL ARCHITECTURES FOR IMAGE AND VIDEO REPRESENTATION LEARNING
Authors: CHEN YUNPENG
Keywords: Computer Vision, Neural Networks, Deep Learning, CNN, Classification, Recognition
Issue Date: 17-Apr-2019
Citation: CHEN YUNPENG (2019-04-17). EFFICIENT DEEP CONVOLUTIONAL ARCHITECTURES FOR IMAGE AND VIDEO REPRESENTATION LEARNING. ScholarBank@NUS Repository.
Abstract: This thesis focuses on improving the quality of the learned deep feature representations by designing more efficient network architectures. It starts from improving the micro building blocks of existing deep convolutional neural networks, follows by improving the global topologies of deep convolutional neural networks, and ends with introducing non-convolutional features to compensate the limitations of convolutional networks. The thesis includes Mulit-fiber Unit, Octave Convolution Operator, Dual Path Networks, Group Orthogonal Networks, Collective Learning Unit, Double Attention Unit, and Graph-based Global Reasoning Unit, all of which are proposed during my Ph.D. period. These proposed methods are extensively evaluated on standard benchmark datasets include ImageNet, MS-COCO, Pascal VOC, Cityscapes for 2D image recognition tasks, and Kinetics for 3D video recognition tasks.
URI: https://scholarbank.nus.edu.sg/handle/10635/159893
Appears in Collections:Ph.D Theses (Open)

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
ChenYunpeng.pdf6.39 MBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.