About the training of CUB200 dataset #41

DZY-irene · 2024-01-09T08:26:14Z

Regarding the training of cub200, I followed all the parameter settings of the source code. I first loaded cc_learned.pth and started training. According to the paper, this should be the VQ-diffusion-F model. After training up to 300epoch, I observed that the val loss has almost stabilized and tested it on 299epoch. But the test result is very bad with a fid of 28. I am curious why this is the case.

My test command is:
VQ_Diffusion_model.inference_generate_sample_with_condition(data,truncation_rate=1.0, save_root="pre/ep299_tr1_gs5",batch_size=1, guidance_scale=5.0)

Here is the tensorboard and the visualization of test result:

I also tried to train VQ-diffusion-B which has no pretrained model. But the result is worse.

Does anyone encounter the same problem?

623851394 · 2024-01-13T01:49:06Z

我也遇到这个问题了，我在CUB上用VQ-diffusion-B训练的图像很抽象，但是用CelebA-HQ训练的人脸还可以，不知道CUB数据集怎么训练的？请问你这个效果是怎么训练出来的？

DZY-irene · 2024-01-13T08:37:48Z

我也遇到这个问题了，我在CUB上用VQ-diffusion-B训练的图像很抽象，但是用CelebA-HQ训练的人脸还可以，不知道CUB数据集怎么训练的？请问你这个效果是怎么训练出来的？

我这个其实算是VQ-diffusion-F，是用了pretrained model的大约300epoch的结果，但是测试结果的fid比论文高了10个点。我也跑过VQ-diffusion-B，结果就更差了。我个人怀疑是CUB数据量太小，导致直接训练的结果很差。但是如果用了pretrained model的情况下结果还是这么差的情况就很令人费解了。我打算在coco上再尝试一下，看看数据量的影响有多大。

pangPhD · 2024-05-07T13:47:17Z

我也遇到这个问题了，我在CUB上用VQ-diffusion-B训练的图像很抽象，但是用CelebA-HQ训练的人脸还可以，不知道CUB数据集怎么训练的？请问你这个效果是怎么训练出来的？

我这个其实算是VQ-diffusion-F，是用了pretrained model的大约300epoch的结果，但是测试结果的fid比论文高了10个点。我也跑过VQ-diffusion-B，结果就更差了。我个人怀疑是CUB数据量太小，导致直接训练的结果很差。但是如果用了pretrained model的情况下结果还是这么差的情况就很令人费解了。我打算在coco上再尝试一下，看看数据量的影响有多大。

请问这个文件在哪里OUTPUT/pretrained_model/taming_dvae/taming_f8_8192_openimages_last.pth，我找半天没找到，谢谢！

pangPhD · 2024-05-07T13:47:45Z

我也遇到这个问题了，我在CUB上用VQ-diffusion-B训练的图像很抽象，但是用CelebA-HQ训练的人脸还可以，不知道CUB数据集怎么训练的？请问你这个效果是怎么训练出来的？

请问你有这个文件吗OUTPUT/pretrained_model/taming_dvae/taming_f8_8192_openimages_last.pth，我没找到在哪，谢谢！

623851394 · 2024-09-06T09:16:01Z

@pangPhD 你需要从他给出的pretrain.sh下载
if [ -f ithq_vqvae.pth ]; then
echo "ithq_vqvae.pth exists"
else
echo "Downloading ithq_vqvae.pth"
wget https://github.com/tzco/storage/releases/download/vqdiffusion/ithq_vqvae.pth
fi

if [ -f taming_f8_8192_openimages_last.pth ]; then
echo "taming_f8_8192_openimages_last.pth exists"
else
echo "Downloading taming_f8_8192_openimages_last.pth"
wget https://github.com/tzco/storage/releases/download/vqdiffusion/taming_f8_8192_openimages_last.pth
fi

if [ -f vqgan_imagenet_f16_16384.pth ]; then
echo "vqgan_imagenet_f16_16384.pth exists"
else
echo "Downloading vqgan_imagenet_f16_16384.pth"
wget https://github.com/tzco/storage/releases/download/vqdiffusion/vqgan_imagenet_f16_16384.pth
fi

if [ -f ViT-B-32.pt ]; then
echo "ViT-B-32.pt exists"
else
echo "Downloading ViT-B-32.pt"
wget https://github.com/tzco/storage/releases/download/vqdiffusion/ViT-B-32.pt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About the training of CUB200 dataset #41

About the training of CUB200 dataset #41

DZY-irene commented Jan 9, 2024

623851394 commented Jan 13, 2024

DZY-irene commented Jan 13, 2024

pangPhD commented May 7, 2024

pangPhD commented May 7, 2024

623851394 commented Sep 6, 2024

About the training of CUB200 dataset #41

About the training of CUB200 dataset #41

Comments

DZY-irene commented Jan 9, 2024

623851394 commented Jan 13, 2024

DZY-irene commented Jan 13, 2024

pangPhD commented May 7, 2024

pangPhD commented May 7, 2024

623851394 commented Sep 6, 2024