Multi-view Consistent Generative Adversarial Networks for Compositional 3D-Aware Image Synthesis | ScholarBank@NUS

Please use this identifier to cite or link to this item: https://doi.org/10.1007/s11263-023-01805-x

Title:	Multi-view Consistent Generative Adversarial Networks for Compositional 3D-Aware Image Synthesis
Authors:	Zhang, Xuanmeng Zheng, Zhedong Gao, Daiheng Zhang, Bang Yang, Yi Chua, Tat-Seng
Keywords:	Science & Technology Technology Computer Science, Artificial Intelligence Computer Science Generative adversarial networks Neural radiance fields 3D-aware image synthesis
Issue Date:	Aug-2023
Publisher:	SPRINGER
Citation:	Zhang, Xuanmeng, Zheng, Zhedong, Gao, Daiheng, Zhang, Bang, Yang, Yi, Chua, Tat-Seng (2023-08). Multi-view Consistent Generative Adversarial Networks for Compositional 3D-Aware Image Synthesis. INTERNATIONAL JOURNAL OF COMPUTER VISION 131 (8) : 2219-2242. ScholarBank@NUS Repository. https://doi.org/10.1007/s11263-023-01805-x
Abstract:	This paper studies compositional 3D-aware image synthesis for both single-object and multi-object scenes. We observe that two challenges remain in this field: existing approaches (1) lack geometry constraints and thus compromise the multi-view consistency of the single object, and (2) can not scale to multi-object scenes with complex backgrounds. To address these challenges coherently, we propose multi-view consistent generative adversarial networks (MVCGAN) for compositional 3D-aware image synthesis. First, we build the geometry constraints on the single object by leveraging the underlying 3D information. Specifically, we enforce the photometric consistency between pairs of views, encouraging the model to learn the inherent 3D shape. Second, we adapt MVCGAN to multi-object scenarios. In particular, we formulate the multi-object scene generation as a “decompose and compose” process. During training, we adopt the top-down strategy to decompose training images into objects and backgrounds. When rendering, we deploy a reverse bottom-up manner by composing the generated objects and background into the holistic scene. Extensive experiments on both single-object and multi-object datasets show that the proposed method achieves competitive performance for 3D-aware image synthesis.
Source Title:	INTERNATIONAL JOURNAL OF COMPUTER VISION
URI:	https://scholarbank.nus.edu.sg/handle/10635/245843
ISSN:	0920-5691 1573-1405
DOI:	10.1007/s11263-023-01805-x
Appears in Collections:	Staff Publications Elements

Show full item record

Files in This Item:

File	Description	Size	Format	Access Settings	Version
Zhang_MVCGAN.pdf	Published version	34.32 MB	Adobe PDF	OPEN	Published	View/Download

Google Scholar^TM

Check

Altmetric

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.