train process problem #6

CS123n · 2024-03-26T09:40:17Z

Hi, I used your code to train SD+T5 on my own.
However, the results deteriorated rapidly after only 500 steps.

Here's what the training loss looks like:

Do you have any advice? I tried changing the learning rate to 1e-5, but it didn't solve the problem.

ShihaoZhaoZSH · 2024-03-26T13:24:10Z

Thank you for your interest in our LaVi-Bridge! We haven't encountered such a situation in our experiment, and the released training and inference code has undergone thorough testing to ensure its correctness. We suggest checking the following points: 1. Adjust the learning rate appropriately. 2. Train using full precision. 3. Double-check the inference process to ensure the correct loading of LoRA and proper input of (un)conditional text embeddings into the adapter.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

train process problem #6

train process problem #6

CS123n commented Mar 26, 2024 •

edited

Loading

ShihaoZhaoZSH commented Mar 26, 2024

train process problem #6

train process problem #6

Comments

CS123n commented Mar 26, 2024 • edited Loading

ShihaoZhaoZSH commented Mar 26, 2024

CS123n commented Mar 26, 2024 •

edited

Loading