updt setup.py; fix tokens_per_expert casting #9

vchiley · 2023-07-26T20:03:37Z

PR does a few little things:

if setup.py imports torch therefore torch should already be installed; it shouldn't be part of the script install_requires.
- ~~torch similarly removed from stanford-stk @ git+https://github.com/vchiley/stk.git@setup_deps~~
use torch.cuda.get_device_capability() to set --generate-code compile flag for nvcc
autocast before all2all if autocast enabled
fix casting issue in def load_balancing_loss(*args)
create megablocks/backend/__init__.py

detect device and set `--generate-code` automatically

Btk setup deps

vchiley and others added 7 commits July 10, 2023 11:21

Update setup.py

87cd325

Update README.md

e9738de

Update moe.py

331b431

Merge branch 'stanford-futuredata:main' into setup_deps

0032d97

Update setup.py

7c9a26d

detect device and set `--generate-code` automatically

Update moe.py

bc47c6d

set all2all dtype using amp precision

876841e

vchiley force-pushed the setup_deps branch from 52b3991 to 876841e Compare July 27, 2023 16:58

vchiley and others added 5 commits July 27, 2023 17:51

merge setup-deps

5ac27d0

Create __init__.py

d9e243a

Merge branch 'main' into btk-setup-deps

1b540db

Merge pull request #1 from vchiley/btk-setup-deps

96dbc98

Btk setup deps

Update setup.py

e2f3587

vchiley marked this pull request as ready for review August 4, 2023 17:41

tgale96 merged commit d090eb4 into databricks:main Aug 10, 2023

Provide feedback