ESM2 Infer partial batches using predict method #304

farhadrgh · 2024-10-11T20:26:59Z

This depends on changes in NVIDIA/NeMo#10837 to expose drop_last in MegatronDataSampler and allow inference of partial batches.

Note:

The NVIDIA/NeMo#10934 has overridden the changes to expose drop_last. We should now wrap dataloaders with nemo.lightning.data.WrappedDataLoader that can store the mode attribute when creating the dataloader in datamodules. Then drop_last=False if dataloader is in test or predict mode.

jstjohn

Yes! Doesn't this also depend on the latest nemo though? Should you bump nemo to top of tree?

jstjohn · 2024-10-11T20:57:50Z

Also see #302 which bumps the nemo version, is it new enough for your needs?

yzhang123 · 2024-10-11T21:25:16Z

/build-ci

farhadrgh · 2024-10-30T16:23:27Z

The NVIDIA/NeMo#10934 has overridden the changes to expose drop_last. We should now wrap dataloaders with nemo.lightning.data.WrappedDataLoader that can store the mode attribute when creating the dataloader in datamodules. Then drop_last=False if dataloader is in test or predict mode.

CC @jstjohn

farhadrgh · 2024-10-30T16:23:36Z

/build-ci

pstjohn

LGTM, but where does this use the new drop_last argument?

farhadrgh · 2024-10-30T16:31:18Z

LGTM, but where does this use the new drop_last argument?

drop_last is no longer exposed. Megatron Sampler set it to false when dataloader mode is test/predict: https://github.com/NVIDIA/NeMo/pull/10934/files#diff-c423cc7981d5621754bb3e5e509fceaa9adb8537b269b7def042995cfa21d529R81

…IA/bionemo-framework into farhadr/infer_partial_batch

farhadrgh · 2024-10-30T18:27:18Z

/build-ci

…IA/bionemo-framework into farhadr/infer_partial_batch

farhadrgh · 2024-10-30T18:41:05Z

/build-ci

farhadrgh · 2024-10-30T18:46:06Z

/build-ci

farhadrgh · 2024-10-30T19:56:40Z

/build-ci

farhadrgh · 2024-10-30T20:17:17Z

/build-ci

farhadrgh · 2024-10-31T15:57:16Z

/build-ci

farhadrgh · 2024-10-31T19:49:37Z

/build-ci

farhadrgh · 2024-11-01T15:37:19Z

/build-ci

farhadrgh and others added 2 commits October 11, 2024 20:24

use exposed drop_last

0f4d2ca

Merge branch 'main' into farhadr/infer_partial_batch

a6ba3b9

farhadrgh requested review from jstjohn, pstjohn and skothenhill-nv October 11, 2024 20:27

farhadrgh added the bug_fix_for_v24.10 label Oct 11, 2024

jstjohn approved these changes Oct 11, 2024

View reviewed changes

jstjohn mentioned this pull request Oct 11, 2024

Update NeMo/Megatron #302

Closed

Merge branch 'main' into farhadr/infer_partial_batch

e93c7f0

yzhang123 enabled auto-merge (squash) October 11, 2024 21:25

farhadrgh added NOT_related_to_v24.10 and removed bug_fix_for_v24.10 labels Oct 11, 2024

farhadrgh and others added 7 commits October 14, 2024 14:30

Merge branch 'main' into farhadr/infer_partial_batch

0137b78

rm comment

9964152

Merge branch 'main' into farhadr/infer_partial_batch

5401bfa

wrap dataloader

89a9893

cleanup

baad307

add missing mode

0a20f50

updating NeMo commit

5f022ef

farhadrgh requested review from malcolmgreaves, ohadmo and trvachov as code owners October 30, 2024 16:22

farhadrgh changed the title ~~Infer using exposed drop_last in MegatronDataSampler~~ ESM2 Infer partial batches using predict method Oct 30, 2024

pstjohn approved these changes Oct 30, 2024

View reviewed changes

farhadrgh removed the NOT_related_to_v24.10 label Oct 30, 2024

farhadrgh self-assigned this Oct 30, 2024

farhadrgh and others added 4 commits October 30, 2024 10:46

Merge branch 'main' into farhadr/infer_partial_batch

b5f6a3c

updating NeMo commit

2854d93

Merge branch 'farhadr/infer_partial_batch' of https://github.com/NVID…

3412839

…IA/bionemo-framework into farhadr/infer_partial_batch

Merge branch 'main' into farhadr/infer_partial_batch

b141dc2

farhadrgh added 2 commits October 30, 2024 18:40

updating Megatron-LM commit

6e92e3c

Merge branch 'farhadr/infer_partial_batch' of https://github.com/NVID…

18ebde5

…IA/bionemo-framework into farhadr/infer_partial_batch

updating NeMo commit

6326e59

fix import

e7fba8d

Merge branch 'main' into farhadr/infer_partial_batch

3fba833

farhadrgh added 2 commits October 31, 2024 14:28

Merge branch 'main' into farhadr/infer_partial_batch

6bb59dd

Merge branch 'main' into farhadr/infer_partial_batch

e0f01f1

farhadrgh mentioned this pull request Nov 19, 2024

ESM2 Tutorial Updates #426

Merged

farhadrgh closed this Nov 19, 2024

auto-merge was automatically disabled November 19, 2024 17:33
Pull request was closed

pstjohn deleted the farhadr/infer_partial_batch branch January 17, 2025 18:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ESM2 Infer partial batches using predict method #304

ESM2 Infer partial batches using predict method #304

farhadrgh commented Oct 11, 2024 •

edited

Loading

jstjohn left a comment

jstjohn commented Oct 11, 2024

yzhang123 commented Oct 11, 2024

farhadrgh commented Oct 30, 2024

farhadrgh commented Oct 30, 2024

pstjohn left a comment

farhadrgh commented Oct 30, 2024

farhadrgh commented Oct 30, 2024

farhadrgh commented Oct 30, 2024

farhadrgh commented Oct 30, 2024

farhadrgh commented Oct 30, 2024

farhadrgh commented Oct 30, 2024

farhadrgh commented Oct 31, 2024

farhadrgh commented Oct 31, 2024

farhadrgh commented Nov 1, 2024

ESM2 Infer partial batches using predict method #304

ESM2 Infer partial batches using predict method #304

Conversation

farhadrgh commented Oct 11, 2024 • edited Loading

jstjohn left a comment

Choose a reason for hiding this comment

jstjohn commented Oct 11, 2024

yzhang123 commented Oct 11, 2024

farhadrgh commented Oct 30, 2024

farhadrgh commented Oct 30, 2024

pstjohn left a comment

Choose a reason for hiding this comment

farhadrgh commented Oct 30, 2024

farhadrgh commented Oct 30, 2024

farhadrgh commented Oct 30, 2024

farhadrgh commented Oct 30, 2024

farhadrgh commented Oct 30, 2024

farhadrgh commented Oct 30, 2024

farhadrgh commented Oct 31, 2024

farhadrgh commented Oct 31, 2024

farhadrgh commented Nov 1, 2024

farhadrgh commented Oct 11, 2024 •

edited

Loading