video support #12

ehartford · 2023-08-19T21:46:45Z

(rewriting sloppy request)
I was wondering if video support can be added?

At first I came up with lucidrain's video-diffusion-pytorch
https://github.com/lucidrains/video-diffusion-pytorch

But, after some research it seems like zeroscope might be the right model to use
https://huggingface.co/cerspense/zeroscope_v2_576w

leejet · 2023-08-20T04:13:05Z

This model appears to be significantly different from stable-diffusion, no plans to support it currently. If there's time in the future, I will consider providing support for it.

ehartford · 2023-08-20T11:55:11Z

I didn't necessarily mean this specific model, more "video" in general.

I think zeroscope would probably be the right place to start.

Sorry for being sloppy.

https://huggingface.co/cerspense/zeroscope_v2_576w

leejet · 2023-08-21T00:28:30Z

It looks like this needs some work, and there are no plans to support it currently. Maybe in the future?

Green-Sky · 2023-11-21T20:06:11Z

stable video diffusion (SVD) models from stability where released!

SVD was trained to generate 14 frames at resolution 576x1024 given a context frame of the same size. We use the standard image encoder from SD 2.1, but replace the decoder with a temporally-aware deflickering decoder

https://stability.ai/news/stable-video-diffusion-open-ai-video-model
https://huggingface.co/stabilityai/stable-video-diffusion-img2vid / https://huggingface.co/stabilityai/stable-video-diffusion-img2vid-xt

leejet · 2023-11-22T12:52:32Z

The SVD demo looks quite good. I'll make time in the next few days to study it, starting by running the official code to see its performance.

Amin456789 · 2023-11-27T17:47:21Z

patiently waiting for SVD to release and being quantized!

FSSRepo · 2023-11-28T03:23:21Z

@leejet
It seems to have almost the same architecture as SD 2.1 but includes some temporal consistency blocks called "time_stack." We'll need to see how they work and whether new functions need to be added to ggml. The conversion program works with this model; however, please note that we'll need to implement the vision version of CLIP to generate embeddings from images.

leejet · 2023-11-28T12:54:58Z

I'm currently reviewing the SVD implementation code in comfyui. Perhaps I can learn how to conveniently implement SVD within sd.cpp from this.

FSSRepo · 2023-11-28T12:56:49Z

I'm currently reviewing the SVD implementation code in comfyui. Perhaps I can learn how to conveniently implement SVD within sd.cpp from this.

Amazing! Good luck!!, Unfortunately, my time is limited as I am a student. Otherwise, I would be more than happy to help.

Amin456789 · 2023-11-28T13:00:22Z

Bless u guys! SVD in cpp will be a dream! Good luck to all of u!

Amin456789 · 2023-12-21T08:22:46Z

@leejet any update and progress on svd and inpainting? really excited to try them out in cpp!

leejet · 2023-12-21T13:04:24Z

I've got a basic understanding of the SVD model architecture. Once I merge the #104 and #117, I'll attempt to implement SVD.

Amin456789 · 2023-12-21T14:10:46Z

niceee! so excited, thanks

Jonathhhan · 2023-12-29T23:51:59Z

Hotshot-XL looks interesting, too and works with SDXL models: https://huggingface.co/hotshotco/Hotshot-XL

Amin456789 · 2024-01-01T15:15:53Z

@leejet it will be great if you support fp16 of SVD when it is done:
https://huggingface.co/becausecurious/stable-video-diffusion-img2vid-fp16/tree/main

they are smaller and probably more ram friendly

engineer1109 · 2024-04-15T10:53:29Z

Need as well.

Amin456789 · 2024-07-15T09:43:47Z

@leejet any update on svd please?

mirix · 2024-10-24T16:05:13Z

I don't know if this is even remotely related to the SD architecture, but it would be could to support the new kid on the block:

https://huggingface.co/genmo/mochi-1-preview

https://huggingface.co/Kijai/Mochi_preview_comfy/tree/main

patrickjonesdotca · 2024-12-09T18:38:27Z

Any updates on SVD?

Zctoylm0927 · 2024-12-10T13:27:17Z

Any updates on SVD?

bombless · 2024-12-18T15:26:38Z

There are more img2vid and txt2vid models coming
https://github.com/THUDM/CogVideo
https://huggingface.co/IamCreateAI/Ruyi-Mini-7B
https://github.com/Tencent/HunyuanVideo

delldu · 2025-01-17T16:47:48Z

Good luck for you !!! In my opinion, applying ggml on video(AI) may be nightmare, because it dose not support tensor more than 4d, so conv3d, batchnorm3d etc will make you crazy !!!

stduhpf · 2025-01-17T17:47:39Z

Good luck for you !!! In my opinion, applying ggml on video(AI) may be nightmare, because it dose not support tensor more than 4d, so conv3d, batchnorm3d etc will make you crazy !!!

I can confirm

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

video support #12

video support #12

ehartford commented Aug 19, 2023 •

edited

Loading

leejet commented Aug 20, 2023

ehartford commented Aug 20, 2023 •

edited

Loading

leejet commented Aug 21, 2023

Green-Sky commented Nov 21, 2023

leejet commented Nov 22, 2023

Amin456789 commented Nov 27, 2023

FSSRepo commented Nov 28, 2023

leejet commented Nov 28, 2023

FSSRepo commented Nov 28, 2023 •

edited

Loading

Amin456789 commented Nov 28, 2023

Amin456789 commented Dec 21, 2023

leejet commented Dec 21, 2023

Amin456789 commented Dec 21, 2023

Jonathhhan commented Dec 29, 2023 •

edited

Loading

Amin456789 commented Jan 1, 2024

engineer1109 commented Apr 15, 2024

Amin456789 commented Jul 15, 2024

mirix commented Oct 24, 2024

patrickjonesdotca commented Dec 9, 2024

Zctoylm0927 commented Dec 10, 2024

bombless commented Dec 18, 2024

delldu commented Jan 17, 2025

stduhpf commented Jan 17, 2025

video support #12

video support #12

Comments

ehartford commented Aug 19, 2023 • edited Loading

leejet commented Aug 20, 2023

ehartford commented Aug 20, 2023 • edited Loading

leejet commented Aug 21, 2023

Green-Sky commented Nov 21, 2023

leejet commented Nov 22, 2023

Amin456789 commented Nov 27, 2023

FSSRepo commented Nov 28, 2023

leejet commented Nov 28, 2023

FSSRepo commented Nov 28, 2023 • edited Loading

Amin456789 commented Nov 28, 2023

Amin456789 commented Dec 21, 2023

leejet commented Dec 21, 2023

Amin456789 commented Dec 21, 2023

Jonathhhan commented Dec 29, 2023 • edited Loading

Amin456789 commented Jan 1, 2024

engineer1109 commented Apr 15, 2024

Amin456789 commented Jul 15, 2024

mirix commented Oct 24, 2024

patrickjonesdotca commented Dec 9, 2024

Zctoylm0927 commented Dec 10, 2024

bombless commented Dec 18, 2024

delldu commented Jan 17, 2025

stduhpf commented Jan 17, 2025

ehartford commented Aug 19, 2023 •

edited

Loading

ehartford commented Aug 20, 2023 •

edited

Loading

FSSRepo commented Nov 28, 2023 •

edited

Loading

Jonathhhan commented Dec 29, 2023 •

edited

Loading