Zippo: Zipping Color and Transparency Distributions into a Single Diffusion Model

Xie, Kangyang; Yang, Binbin; Chen, Hao; Wang, Meng; Zou, Cheng; Xue, Hui; Yang, Ming; Shen, Chunhua

Computer Science > Computer Vision and Pattern Recognition

arXiv:2403.11077 (cs)

[Submitted on 17 Mar 2024 (v1), last revised 19 Mar 2024 (this version, v2)]

Title:Zippo: Zipping Color and Transparency Distributions into a Single Diffusion Model

Authors:Kangyang Xie, Binbin Yang, Hao Chen, Meng Wang, Cheng Zou, Hui Xue, Ming Yang, Chunhua Shen

View PDF HTML (experimental)

Abstract:Beyond the superiority of the text-to-image diffusion model in generating high-quality images, recent studies have attempted to uncover its potential for adapting the learned semantic knowledge to visual perception tasks. In this work, instead of translating a generative diffusion model into a visual perception model, we explore to retain the generative ability with the perceptive adaptation. To accomplish this, we present Zippo, a unified framework for zipping the color and transparency distributions into a single diffusion model by expanding the diffusion latent into a joint representation of RGB images and alpha mattes. By alternatively selecting one modality as the condition and then applying the diffusion process to the counterpart modality, Zippo is capable of generating RGB images from alpha mattes and predicting transparency from input images. In addition to single-modality prediction, we propose a modality-aware noise reassignment strategy to further empower Zippo with jointly generating RGB images and its corresponding alpha mattes under the text guidance. Our experiments showcase Zippo's ability of efficient text-conditioned transparent image generation and present plausible results of Matte-to-RGB and RGB-to-Matte translation.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2403.11077 [cs.CV]
	(or arXiv:2403.11077v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2403.11077

Submission history

From: Chunhua Shen [view email]
[v1] Sun, 17 Mar 2024 04:02:39 UTC (46,042 KB)
[v2] Tue, 19 Mar 2024 08:21:02 UTC (28,350 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Zippo: Zipping Color and Transparency Distributions into a Single Diffusion Model

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Zippo: Zipping Color and Transparency Distributions into a Single Diffusion Model

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators