create faces with stylegan and dalle? #27

molo32 · 2021-01-24T04:15:39Z

amazing job, I was wondering if anyone has a colab notebook to create faces from text with or without stylegan2 by ex.

lucidrains · 2021-01-24T21:05:10Z

@molo32 yes! it is now possible, but DALLE is not needed. by simply combining CLIP with Stylegan2, we can now summon images from the latent space of a trained generator. I will add it to my stylegan2 repository https://github.com/lucidrains/stylegan2-pytorch in due time :) focused on equivariant attention for alphafold2 at the moment

lucidrains · 2021-01-24T21:29:25Z

@molo32 somebody else already did it :) you can try it at https://twitter.com/advadnoun/status/1353453719510163459?s=20

powderblock · 2021-01-24T23:06:15Z

@molo32 somebody else already did it :) you can try it at https://twitter.com/advadnoun/status/1353453719510163459?s=20

wow this is awesome!!! any idea how to use this for faces?

lucidrains · 2021-01-25T00:57:49Z

@powderblock https://twitter.com/pbaylies/status/1348313176115458048?s=20

lucidrains · 2021-01-25T00:59:43Z

@powderblock it'll work with any generator! the latent space has suddenly become infinitely more traversable, by way of another neural network as the guide :)

rom1504 · 2021-01-25T02:28:01Z

Is this done simply by generating a batch of images with a generator, ranking them with clip and trying again randomly until the dot product is high enough ?
Or could this be done instead by retropropagating the error like stylegan encoder is doing ? (https://github.com/Puzer/stylegan-encoder)

lucidrains · 2021-01-25T02:47:44Z

@rom1504 both!

rom1504 · 2021-01-25T03:05:48Z

Using a multi modal encoder trained for similarity and a generator in that way seems like a really powerful idea.
I wonder if clip would work to generate very accurate descriptions too (take a picture, use a language model to generate text, retropropagate... Until you get good enough dot product with the picture)
And more generally, if we have more such multi modal encoders (text audio, text 3d model, ...), it seems to open the gate to generating almost anything.

lucidrains · 2021-01-25T03:17:59Z

@rom1504 if you follow Mario @ quasimodo on twitter, he was able to coax out text from CLIP. i think the surer thing to do is to rank text generation from existing caption to text transformers, as an alternative to beam search

lucidrains · 2021-01-25T03:18:56Z

@rom1504 yes, multimodal is here, attention was all we need

rom1504 · 2021-01-25T03:34:43Z

Is that https://twitter.com/Quasimodo ? Seems to be unavailable

lucidrains · 2021-01-25T03:35:12Z

oops, @ quasimondo

afiaka87 mentioned this issue May 31, 2021

Closing some old issues #262

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

create faces with stylegan and dalle? #27

create faces with stylegan and dalle? #27

molo32 commented Jan 24, 2021 •

edited

Loading

lucidrains commented Jan 24, 2021

lucidrains commented Jan 24, 2021

powderblock commented Jan 24, 2021

lucidrains commented Jan 25, 2021

lucidrains commented Jan 25, 2021

rom1504 commented Jan 25, 2021 •

edited

Loading

lucidrains commented Jan 25, 2021

rom1504 commented Jan 25, 2021 •

edited

Loading

lucidrains commented Jan 25, 2021 •

edited

Loading

lucidrains commented Jan 25, 2021

rom1504 commented Jan 25, 2021

lucidrains commented Jan 25, 2021

create faces with stylegan and dalle? #27

create faces with stylegan and dalle? #27

Comments

molo32 commented Jan 24, 2021 • edited Loading

lucidrains commented Jan 24, 2021

lucidrains commented Jan 24, 2021

powderblock commented Jan 24, 2021

lucidrains commented Jan 25, 2021

lucidrains commented Jan 25, 2021

rom1504 commented Jan 25, 2021 • edited Loading

lucidrains commented Jan 25, 2021

rom1504 commented Jan 25, 2021 • edited Loading

lucidrains commented Jan 25, 2021 • edited Loading

lucidrains commented Jan 25, 2021

rom1504 commented Jan 25, 2021

lucidrains commented Jan 25, 2021

molo32 commented Jan 24, 2021 •

edited

Loading

rom1504 commented Jan 25, 2021 •

edited

Loading

rom1504 commented Jan 25, 2021 •

edited

Loading

lucidrains commented Jan 25, 2021 •

edited

Loading