-
Notifications
You must be signed in to change notification settings - Fork 42
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Any plan support cogvideo v1.5 lora and full parameter finetune? #85
Comments
Hi, yes we are going to be running some experiments now that diffusers adaptation is complete. I think it should already work out-of-the-box and can be finetuned just by changing model_id parameter in the script. We're waiting for the authors to validate implementation and open source the diffusers format weights at the moment |
@a-r-r-o-w I also believe that it need the diffusion adaptation is complete. But I found that there is some change in the cogvideo v1.5:
following the introduction in: https://hub.baai.ac.cn/view/40956
|
I think these all correspond to the pre-training strategy and deviate a bit from the original question. I will let @zRzRzRzRzRzRzR answer them. |
seems just changing the model_id isn't enough, I got this error when attempting. updated to diffusers 0.32 just incase and tried making sure it was set to bfloat16.
|
@Cubey42 same error. Let me know if you found a fix. |
|
@Cubey42 @TrickyBarrel The diffusion adaptation is complete? |
1.5-5B and 1.5-5B-I2V are on hugging face with diffusers updated to 0.32 |
got past the tensor size but stuck again here
EDIT: so I don't overspam, I've found the route to be --load_tensors but without it, I can't actually train |
This could be because we might have to do the data reshaping accordingly. What I would recommend doing is:
It would take a bit of time for us to do it ourselves until the CogVideoX 1.5 PR is actually merged into I can understand if this is frustrating but we cannot do much as of now. |
I'll have to examine this more but thanks for the response, great work so far! |
anyone experienced the following error? |
continuous attention! |
I've been trying to get this pr to work, but it seems to my cuda not findable. everything worked before but now I get: I've rebuilt the venv and rechecked cuda to make sure
|
That is super weird as I'm able to run it. Are you launching with the training script with |
I found the GPU ID was different between them and adjusting this value resolved the issue, I'm still getting
if I do I2V instead, I get the
|
Okay with the help of a fellow discord user, it seems I needed to do 56 frames instead of 49 line things up. |
Sorry for the late response, I got busy with something. Yes Cog 1.5 required number of latent frames to be divisible by 2. So, for 49 sample frames, you get 13 latent frames (calculated as (x - 1) / 4 + 1). For 53, it is 14 latent frames which would work. Btw, would love to join any servers you're discussing on to gather feedback! We have many improvements, with further memory savings, planned in the coming weeks as we try to find time outside of normal work. |
#92 added support. Feel free to share your results, too. Do you think this issue can be closed now? |
No problem, this is the server (Banodoco) here, we have a wide variety of users and its mainly about animation/video AI. https://discord.gg/qjSMmUKG we have channels for cogvideox along with many other models. More than welcome to join us! |
The invite link seems to be invalid. |
https://discord.gg/P9EbDYFT try this, if not, search Banodoco on google or try https://discord.gg/eKQm3uHKx2 |
Hi, have you tried training with 56 frames in I2V, I used diffusers0.32 but Timesteps should be a 1d-array |
I still have the same error even after setting max_num_frames= 56 for I2V. Any suggestions to solve it? |
No description provided.
The text was updated successfully, but these errors were encountered: