CUDA 11/A100 Support #866

afiaka87 · 2021-03-17T18:42:54Z

I saw in another thread that there are plans to target the A100 next. This sounds very useful to me as I'm trying to use sparse attention in another project and I've had luck getting access to A100's recently.

CUDA 11 is quickly becoming the de facto standard on a lot of cloud servers. I'm aware that I could roll back to 10.2, but I'd love to get the improvements of both.

Anyway, keep me posted on this feature if you can.

Thanks

awan-10 · 2021-04-14T17:45:47Z

Hi @afiaka87, can you please elaborate the problem a bit more?

Did you try to build from source on a CUDA11 machine and it failed?

We do support CUDA11 and A100.

afiaka87 · 2021-04-22T10:39:43Z

We're having a lot of trouble getting Sparse attention specifically to work. The issue remains even after the ZeRO Infinity update. I'll follow up tomorrow with an error message

@awan-10

helena-balabin · 2021-05-26T12:29:46Z

Do you have any advice on how to configure deepspeed for CUDA11 and an A100? I'm currently trying to set it up, but I'm constantly running into incompatibility issues.

loadams · 2023-08-18T20:27:02Z

Hi @afiaka87 and @helena-balabin - I'm closing this issue as stale given the age of it. Cuda/Torch for A100s should be more well supported now there and in DeepSpeed. If you are still having any issues, please open a new issue and link this one and I'd be happy to take a look at any issues.

Thanks!

afiaka87 changed the title ~~A100 Support~~ CUDA 11/A100 Support Mar 27, 2021

afiaka87 mentioned this issue Mar 29, 2021

Reproducing DALL-E using DeepSpeed lucidrains/DALLE-pytorch#137

Open

awan-10 self-assigned this Apr 14, 2021

loadams closed this as completed Aug 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA 11/A100 Support #866

CUDA 11/A100 Support #866

afiaka87 commented Mar 17, 2021 •

edited

Loading

awan-10 commented Apr 14, 2021

afiaka87 commented Apr 22, 2021 •

edited

Loading

helena-balabin commented May 26, 2021 •

edited

Loading

loadams commented Aug 18, 2023

CUDA 11/A100 Support #866

CUDA 11/A100 Support #866

Comments

afiaka87 commented Mar 17, 2021 • edited Loading

awan-10 commented Apr 14, 2021

afiaka87 commented Apr 22, 2021 • edited Loading

helena-balabin commented May 26, 2021 • edited Loading

loadams commented Aug 18, 2023

afiaka87 commented Mar 17, 2021 •

edited

Loading

afiaka87 commented Apr 22, 2021 •

edited

Loading

helena-balabin commented May 26, 2021 •

edited

Loading