EFFICIENT DEEP CONVOLUTIONAL ARCHITECTURES FOR IMAGE AND VIDEO REPRESENTATION LEARNING

Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/159893

Title:	EFFICIENT DEEP CONVOLUTIONAL ARCHITECTURES FOR IMAGE AND VIDEO REPRESENTATION LEARNING
Authors:	CHEN YUNPENG
Keywords:	Computer Vision, Neural Networks, Deep Learning, CNN, Classification, Recognition
Issue Date:	17-Apr-2019
Citation:	CHEN YUNPENG (2019-04-17). EFFICIENT DEEP CONVOLUTIONAL ARCHITECTURES FOR IMAGE AND VIDEO REPRESENTATION LEARNING. ScholarBank@NUS Repository.
Abstract:	This thesis focuses on improving the quality of the learned deep feature representations by designing more efficient network architectures. It starts from improving the micro building blocks of existing deep convolutional neural networks, follows by improving the global topologies of deep convolutional neural networks, and ends with introducing non-convolutional features to compensate the limitations of convolutional networks. The thesis includes Mulit-fiber Unit, Octave Convolution Operator, Dual Path Networks, Group Orthogonal Networks, Collective Learning Unit, Double Attention Unit, and Graph-based Global Reasoning Unit, all of which are proposed during my Ph.D. period. These proposed methods are extensively evaluated on standard benchmark datasets include ImageNet, MS-COCO, Pascal VOC, Cityscapes for 2D image recognition tasks, and Kinetics for 3D video recognition tasks.
URI:	https://scholarbank.nus.edu.sg/handle/10635/159893
Appears in Collections:	Ph.D Theses (Open)

File	Description	Size	Format	Access Settings	Version
ChenYunpeng.pdf		6.39 MB	Adobe PDF	OPEN	None	View/Download

Check