Please use this identifier to cite or link to this item:
https://scholarbank.nus.edu.sg/handle/10635/159893
Title: | EFFICIENT DEEP CONVOLUTIONAL ARCHITECTURES FOR IMAGE AND VIDEO REPRESENTATION LEARNING | Authors: | CHEN YUNPENG | Keywords: | Computer Vision, Neural Networks, Deep Learning, CNN, Classification, Recognition | Issue Date: | 17-Apr-2019 | Citation: | CHEN YUNPENG (2019-04-17). EFFICIENT DEEP CONVOLUTIONAL ARCHITECTURES FOR IMAGE AND VIDEO REPRESENTATION LEARNING. ScholarBank@NUS Repository. | Abstract: | This thesis focuses on improving the quality of the learned deep feature representations by designing more efficient network architectures. It starts from improving the micro building blocks of existing deep convolutional neural networks, follows by improving the global topologies of deep convolutional neural networks, and ends with introducing non-convolutional features to compensate the limitations of convolutional networks. The thesis includes Mulit-fiber Unit, Octave Convolution Operator, Dual Path Networks, Group Orthogonal Networks, Collective Learning Unit, Double Attention Unit, and Graph-based Global Reasoning Unit, all of which are proposed during my Ph.D. period. These proposed methods are extensively evaluated on standard benchmark datasets include ImageNet, MS-COCO, Pascal VOC, Cityscapes for 2D image recognition tasks, and Kinetics for 3D video recognition tasks. | URI: | https://scholarbank.nus.edu.sg/handle/10635/159893 |
Appears in Collections: | Ph.D Theses (Open) |
Show full item record
Files in This Item:
File | Description | Size | Format | Access Settings | Version | |
---|---|---|---|---|---|---|
ChenYunpeng.pdf | 6.39 MB | Adobe PDF | OPEN | None | View/Download |
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.