Granitemoe #33207

mayank31398 · 2024-08-29T19:56:01Z

This PR adds support for IBM's PowerMoE model (3B)
This PR will also form a basis for IBM's upcoming MoE models by end of this month

text models: @ArthurZucker and @younesbelkada

mayank31398 · 2024-09-11T16:38:34Z

Hi @ArthurZucker any updates on this?

ArthurZucker

Let's just add an integration tests with generation!

src/transformers/models/granitemoe/configuration_granitemoe.py

ArthurZucker · 2024-09-20T23:43:16Z

Thanks, let's merge!

mayank31398 · 2024-09-21T20:16:46Z

Thanks

* first commit * drop tokenizer * drop tokenizer * drop tokenizer * drop convert * granite * drop tokenization test * mup * fix * reformat * reformat * reformat * fix docs * stop checking for checkpoint * update support * attention multiplier * update model * tiny drop * saibo drop * skip test * fix test * fix test * drop * drop useless imports * update docs * drop flash function * copied from * drop pretraining tp * drop pretraining tp * drop pretraining tp * drop unused import * drop code path * change name * softmax scale * head dim * drop legacy cache * rename params * cleanup * fix copies * comments * add back legacy cache * multipliers * multipliers * multipliers * text fix * fix copies * merge * multipliers * attention multiplier * drop unused imports * add granitemoe * add decoration * remove moe from sequenceclassification * fix test * fix * fix * fix * move rope? * merge * drop bias * drop bias * Update src/transformers/models/granite/configuration_granite.py Co-authored-by: Arthur <[email protected]> * fix * Update src/transformers/models/granite/modeling_granite.py Co-authored-by: Arthur <[email protected]> * fix * fix * fix * fix * drop * drop * fix * fix * cleanup * cleanup * fix * fix granite tests * fp32 test * fix * drop jitter * fix * rename * rename * fix config * add gen test --------- Co-authored-by: Yikang Shen <[email protected]> Co-authored-by: Arthur <[email protected]>

mayank31398 added 30 commits June 19, 2024 13:48

first commit

750ca7f

drop tokenizer

3b26730

drop tokenizer

9c017b0

drop tokenizer

876f4b5

Merge branch 'main' into granite

0f716ec

drop convert

e3cdcaf

granite

3e4391e

drop tokenization test

6f0cf35

mup

2d1a58c

fix

ac560ae

reformat

78c81a0

reformat

3b6c755

reformat

f46bf82

fix docs

272af5c

stop checking for checkpoint

c9b2288

update support

19ec830

attention multiplier

a9dba03

update model

df90fbd

tiny drop

c3369a0

saibo drop

6a7c814

skip test

dad1e4a

fix test

5cba841

fix test

e8f5886

drop

1678792

drop useless imports

9498556

update docs

039b377

Merge branch 'main' into granite

1bea763

Merge branch 'main' into granite

2a9d734

drop flash function

2442492

copied from

2efe0a6

mayank31398 added 4 commits August 29, 2024 15:57

fix

189bfbf

Merge branch 'main' into granitemoe

e187628

cleanup

569a8c1

cleanup

de02e91

mayank31398 force-pushed the granitemoe branch from 289ce7c to de02e91 Compare August 29, 2024 21:09

mayank31398 added 6 commits August 29, 2024 17:12

fix

29cccf9

fix granite tests

66de9b9

Merge branch 'main' into granitemoe

f960ab8

fp32 test

9a8eb91

fix

f6a364a

drop jitter

2d375af

mayank31398 marked this pull request as ready for review September 3, 2024 19:22

mayank31398 added 3 commits September 3, 2024 16:14

fix

54f3dec

rename

9b3ee92

rename

0a75629

shawntan mentioned this pull request Sep 5, 2024

[Model] Adding Granite MoE. vllm-project/vllm#8206

Merged

gabe-l-hart mentioned this pull request Sep 11, 2024

IBM Granite MoE Architecture ggerganov/llama.cpp#9438

Merged

5 tasks

ArthurZucker approved these changes Sep 13, 2024

View reviewed changes

src/transformers/models/granitemoe/configuration_granitemoe.py Outdated Show resolved Hide resolved

src/transformers/models/granitemoe/configuration_granitemoe.py Outdated Show resolved Hide resolved

mayank31398 added 2 commits September 13, 2024 16:23

Merge branch 'main' into granitemoe

b5abefb

fix config

f3283a1

mayank31398 force-pushed the granitemoe branch from ee5c615 to f3283a1 Compare September 13, 2024 22:08

add gen test

a4a9432

ArthurZucker merged commit e472e07 into huggingface:main Sep 20, 2024
18 of 20 checks passed

mayank31398 deleted the granitemoe branch September 21, 2024 20:16

mayank31398 restored the granitemoe branch September 24, 2024 21:28

mayank31398 deleted the granitemoe branch September 24, 2024 21:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Granitemoe #33207

Granitemoe #33207

mayank31398 commented Aug 29, 2024 •

edited

Loading

mayank31398 commented Sep 11, 2024

ArthurZucker left a comment

ArthurZucker commented Sep 20, 2024

mayank31398 commented Sep 21, 2024

Granitemoe #33207

Granitemoe #33207

Conversation

mayank31398 commented Aug 29, 2024 • edited Loading

mayank31398 commented Sep 11, 2024

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker commented Sep 20, 2024

mayank31398 commented Sep 21, 2024

mayank31398 commented Aug 29, 2024 •

edited

Loading