QLoRA fin-tunes a custom model with 4-bits, and inference the video, then we got : #106

BUAACY · 2024-10-20T11:10:29Z

RuntimeError: Error(s) in loading state_dict for Videollama2MistralForCausalLM:
size mismatch for model.mm_projector.readout.0.weight: copying a param with shape torch.Size([4096, 4096]) from checkpoint, the shape in current model is torch.Size([8388608, 1]).
size mismatch for model.mm_projector.readout.2.weight: copying a param with shape torch.Size([4096, 4096]) from checkpoint, the shape in current model is torch.Size([8388608, 1]).

LiangMeng89 · 2024-11-13T18:02:23Z

RuntimeError: Error(s) in loading state_dict for Videollama2MistralForCausalLM: size mismatch for model.mm_projector.readout.0.weight: copying a param with shape torch.Size([4096, 4096]) from checkpoint, the shape in current model is torch.Size([8388608, 1]). size mismatch for model.mm_projector.readout.2.weight: copying a param with shape torch.Size([4096, 4096]) from checkpoint, the shape in current model is torch.Size([8388608, 1]).

Hello,I'm a phD student from ZJU, I also use videollama2 to do my own research,we create a WeChat group to discuss some issues of videollama2 and help each other,could you join us? Please contact me: WeChat number == LiangMeng19357260600, phone number == +86 19357260600,e-mail == [email protected].

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

QLoRA fin-tunes a custom model with 4-bits, and inference the video, then we got : #106

QLoRA fin-tunes a custom model with 4-bits, and inference the video, then we got : #106

BUAACY commented Oct 20, 2024

LiangMeng89 commented Nov 13, 2024

QLoRA fin-tunes a custom model with 4-bits, and inference the video, then we got : #106

QLoRA fin-tunes a custom model with 4-bits, and inference the video, then we got : #106

Comments

BUAACY commented Oct 20, 2024

LiangMeng89 commented Nov 13, 2024