[Multimodal model] Add the decoder of the multimodal model #626

fduwjj · 2024-10-18T05:22:05Z

After #589, we are adding decoder model to torchtitan with simple unit test.

torchtitan/models/llama_multimodal/model.py

tianyu-l

As discussed offline, let's create separate files (in the folder) for encoder, decoder, and shared parts. This would make code more readable than a 1.5k LoC monolithic file.

I had a similar suggestion for ModelArgs, since it seems to be shared by encoder and decoder.

test/multimodal_model/test_multimodal_model.py

torchtitan/models/llama_multimodal/__init__.py

torchtitan/models/llama_multimodal/model.py

tianyu-l

Looks good to me. Please update the ModelArgs before merge.

kwen2501

Nice work!

After pytorch#589, we are adding decoder model to torchtitan with simple unit test.

Add the decoder of the multimodal model

5ac043b

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 18, 2024

fduwjj requested review from tianyu-l, awgu, kwen2501, wz337 and lessw2020 October 18, 2024 05:22

fduwjj mentioned this pull request Oct 18, 2024

add Llama 3.2 support #625

Open

awgu reviewed Oct 21, 2024

View reviewed changes

torchtitan/models/llama_multimodal/model.py Outdated Show resolved Hide resolved

torchtitan/models/llama_multimodal/model.py Outdated Show resolved Hide resolved

tianyu-l reviewed Oct 21, 2024

View reviewed changes

Address comments from reviewer

ec337c3

fduwjj requested review from awgu and tianyu-l October 22, 2024 16:33

tianyu-l approved these changes Oct 23, 2024

View reviewed changes

Address comments from reviewers

74ec97e

kwen2501 approved these changes Oct 25, 2024

View reviewed changes

fduwjj merged commit ccfc02b into main Oct 25, 2024
5 checks passed

fduwjj deleted the vision_decoder branch October 25, 2024 03:34

mori360 pushed a commit to mori360/torchtitan that referenced this pull request Nov 26, 2024

[Multimodal model] Add the decoder of the multimodal model (pytorch#626)

57ba1b8

After pytorch#589, we are adding decoder model to torchtitan with simple unit test.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Multimodal model] Add the decoder of the multimodal model #626

[Multimodal model] Add the decoder of the multimodal model #626

fduwjj commented Oct 18, 2024

tianyu-l left a comment

tianyu-l left a comment

kwen2501 left a comment

[Multimodal model] Add the decoder of the multimodal model #626

[Multimodal model] Add the decoder of the multimodal model #626

Conversation

fduwjj commented Oct 18, 2024

tianyu-l left a comment

Choose a reason for hiding this comment

tianyu-l left a comment

Choose a reason for hiding this comment

kwen2501 left a comment

Choose a reason for hiding this comment