Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[bug] smdistributed is not included in pytorch-training:2.4.0-gpu-py311 #4395

Open
4 of 6 tasks
gui-miotto opened this issue Oct 29, 2024 · 4 comments
Open
4 of 6 tasks

Comments

@gui-miotto
Copy link

Checklist

Concise Description:

Running

import smdistributed.dataparallel.torch.torch_smddp

Results in

ModuleNotFoundError: No module named 'smdistributed'

DLC image/dockerfile:

763104351884.dkr.ecr.us-east-1.amazonaws.com/pytorch-training:2.4.0-gpu-py311-cu124-ubuntu22.04-sagemaker

@sirutBuasai
Copy link
Contributor

Hi, SMDDP releases are not coupled with DLC which is why it is not currently present in PyTorch 2.4 Training DLC. However, we are working on releasing a new SMDDP binary compatible with PyTorch 2.4 in the coming weeks.

I will follow up once we have a PR up for adding the SMDDP binary on PyTorch 2.4 DLC.

@sirutBuasai
Copy link
Contributor

Hi, to follow up, we have a PR open for adding SMDDP to PT 2.4 Training SM DLC #4403

@gui-miotto
Copy link
Author

Hey @sirutBuasai , thanks for the kind reply.
I'm happy that you guys shipped an PyTorch 2.5.1 DLC!
Do you know when the SMDDP for PyTorch 2.5 will be released? I'm talking about the .whl itself, not necessarily a DLC containing it.
Checking the release notes, it seems to me that cadence is so slow. :-(

@sirutBuasai
Copy link
Contributor

@gui-miotto, unfortunately we don't have direct insight into the SMDDP release schedule.
For PT 2.4 SM DLC, we have a RC build with SMDDP and we're planning on releasing a new version on 12/9/24

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants