3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation

Zhang, Frank; Zhang, Yibo; Zheng, Quan; Ma, Rui; Hua, Wei; Bao, Hujun; Xu, Weiwei; Zou, Changqing

Computer Science > Computer Vision and Pattern Recognition

arXiv:2403.09439 (cs)

[Submitted on 14 Mar 2024]

Title:3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation

Authors:Frank Zhang, Yibo Zhang, Quan Zheng, Rui Ma, Wei Hua, Hujun Bao, Weiwei Xu, Changqing Zou

View PDF HTML (experimental)

Abstract:Text-driven 3D scene generation techniques have made rapid progress in recent years. Their success is mainly attributed to using existing generative models to iteratively perform image warping and inpainting to generate 3D scenes. However, these methods heavily rely on the outputs of existing models, leading to error accumulation in geometry and appearance that prevent the models from being used in various scenarios (e.g., outdoor and unreal scenarios). To address this limitation, we generatively refine the newly generated local views by querying and aggregating global 3D information, and then progressively generate the 3D scene. Specifically, we employ a tri-plane features-based NeRF as a unified representation of the 3D scene to constrain global 3D consistency, and propose a generative refinement network to synthesize new contents with higher quality by exploiting the natural image prior from 2D diffusion model as well as the global 3D information of the current scene. Our extensive experiments demonstrate that, in comparison to previous methods, our approach supports wide variety of scene generation and arbitrary camera trajectories with improved visual quality and 3D consistency.

Comments:	11 pages, 7 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2403.09439 [cs.CV]
	(or arXiv:2403.09439v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2403.09439

Submission history

From: Zhang Songchun [view email]
[v1] Thu, 14 Mar 2024 14:31:22 UTC (6,998 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators