[Community Pipeline] MagicMix #1839

daspartho · 2022-12-26T20:27:40Z

Community pipeline based on my implementation of the MagicMix: Semantic Mixing with Diffusion Models paper.

This Diffusion Pipeline allows for the semantic mixing of an image and a text prompt to create a new concept while preserving the spatial layout and geometry of the subject in the image.

Here are some examples I reproduced from the paper using my implementation-

Input Image:

Prompt: "Bed"

Output Image:

Input Image:

Prompt: "Family"

Output Image:

Input Image:

Prompt: "ice-cream"

Output Image:

Input Image:

Prompt: "Cake"

Output Image:

@patrickvonplaten could you please take a look at it, looking forward to any comments!
Thanks :)

Reference: #841

HuggingFaceDocBuilderDev · 2022-12-26T20:31:52Z

The documentation is not available anymore as the PR was closed or merged.

aengusng8 · 2022-12-27T03:55:09Z

Cool! But why did you put it in the community pipeline instead of the internal pipeline?

patil-suraj

Very cool @daspartho ! The PR looks good, just left some nits

patil-suraj · 2022-12-27T13:51:27Z

examples/community/README.md

+pipe = DiffusionPipeline.from_pretrained(
+    "CompVis/stable-diffusion-v1-4",
+    custom_pipeline="magic_mix",
+    scheduler = DDIMScheduler(beta_start=0.00085, beta_end=0.012, beta_schedule="scaled_linear", clip_sample=False, set_alpha_to_one=False),


The scheduler can be loaded using

DDIMScheduler.from_pretrained("CompVis/stable-diffusion-v1-4", subfolder="scheduler")

patil-suraj · 2022-12-27T13:52:04Z

examples/community/README.md

+pipe = DiffusionPipeline.from_pretrained(
+    "CompVis/stable-diffusion-v1-4",
+    custom_pipeline="magic_mix",


Also, maybe load the pipeline in fp16, by passing the torch_dtype argument, to make inference faster.

patil-suraj · 2022-12-27T13:54:08Z

examples/community/magic_mix.py

+        prompt: str,
+        kmin: float = 0.3,
+        kmax: float = 0.6,
+        v: float = 0.5,


could we use a more descriptive name for this argument? One letter variables aren't informative

daspartho · 2022-12-28T07:46:57Z

@patil-suraj made some changes :)

daspartho · 2022-12-28T07:54:49Z

could we use a more descriptive name for this argument? One letter variables aren't informative

The v parameter is the interpolation constant used in the layout generation process, so I settled for mix_factor as the new parameter name.
It is clear and more descriptive, and it helps to convey the purpose of the parameter in the context of the code.

wyt @patil-suraj

patil-suraj

Thanks a lot! mix_factor sounds good to me.

daspartho added 5 commits December 26, 2022 03:43

initial

87732e2

type hints

d930ced

update scheduler type hint

6847d59

add to README

a3095f1

add example generation to README

cbf4ca5

patil-suraj approved these changes Dec 27, 2022

View reviewed changes

daspartho added 2 commits December 28, 2022 11:08

v -> mix_factor

28e1a3c

load scheduler from pretrained

573dec8

daspartho requested a review from patil-suraj December 28, 2022 07:55

patil-suraj approved these changes Dec 28, 2022

View reviewed changes

patil-suraj merged commit 2ba42aa into huggingface:main Dec 28, 2022

daspartho deleted the magic_mix branch January 3, 2023 17:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Community Pipeline] MagicMix #1839

[Community Pipeline] MagicMix #1839

daspartho commented Dec 26, 2022

HuggingFaceDocBuilderDev commented Dec 26, 2022 •

edited

Loading

aengusng8 commented Dec 27, 2022

patil-suraj left a comment

patil-suraj Dec 27, 2022

patil-suraj Dec 27, 2022

patil-suraj Dec 27, 2022 •

edited

Loading

daspartho commented Dec 28, 2022

daspartho commented Dec 28, 2022 •

edited

Loading

patil-suraj left a comment

[Community Pipeline] MagicMix #1839

[Community Pipeline] MagicMix #1839

Conversation

daspartho commented Dec 26, 2022

Input Image:

Prompt: "Bed"

Output Image:

Input Image:

Prompt: "Family"

Output Image:

Input Image:

Prompt: "ice-cream"

Output Image:

Input Image:

Prompt: "Cake"

Output Image:

HuggingFaceDocBuilderDev commented Dec 26, 2022 • edited Loading

aengusng8 commented Dec 27, 2022

patil-suraj left a comment

Choose a reason for hiding this comment

patil-suraj Dec 27, 2022

Choose a reason for hiding this comment

patil-suraj Dec 27, 2022

Choose a reason for hiding this comment

patil-suraj Dec 27, 2022 • edited Loading

Choose a reason for hiding this comment

daspartho commented Dec 28, 2022

daspartho commented Dec 28, 2022 • edited Loading

patil-suraj left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Dec 26, 2022 •

edited

Loading

patil-suraj Dec 27, 2022 •

edited

Loading

daspartho commented Dec 28, 2022 •

edited

Loading