Add support for adapter loading in mllama #669

ajtejankar · 2024-11-08T21:38:29Z

Supporting adapter loading in VLMs, specifically mllama models. This is more complex than we had initially anticipated as a VLM is under the hood a composite model with multiple trunks which requires separating and loading the lora weights independently for each trunk. Further, currently we don't have lora adapters for certain layers in one of the trunks (cross attention layers) which requires careful handling. Finally, the linear layer classes currently used don't support adapter weights (currently investigating).

ajtejankar · 2024-11-09T02:03:11Z

The PR is ready from my side. Loading randomized adapter results in garbage output while the model answers correctly without it. Slowly changing the noise in the randomized adapter also slowly degrades the quality of the model's output again indicating that adapters are being loaded correctly.

tgaddair

LGTM!

ajtejankar added 2 commits November 8, 2024 13:33

add support for adapter loading in mllama

c279221

switch to adapter based linear layers in vision trunk

50c84f6

ajtejankar requested review from tgaddair and arnavgarg1 November 9, 2024 02:03

tgaddair approved these changes Nov 12, 2024

View reviewed changes

ajtejankar merged commit fdda45a into main Nov 12, 2024
1 check passed

ajtejankar deleted the support-mllama branch November 12, 2024 20:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for adapter loading in mllama #669

Add support for adapter loading in mllama #669

ajtejankar commented Nov 8, 2024

ajtejankar commented Nov 9, 2024

tgaddair left a comment

Add support for adapter loading in mllama #669

Add support for adapter loading in mllama #669

Conversation

ajtejankar commented Nov 8, 2024

ajtejankar commented Nov 9, 2024

tgaddair left a comment

Choose a reason for hiding this comment