Fail to install deepspeed through ./install.sh : ModuleNotFoundError: No module named 'deepspeed' #472

visionscaper · 2020-10-14T21:59:15Z

Hello,

When on the master branch of Deepspeed, installing Deepspeed using ./install.sh in a virtual python environment fails.
I already tried to use the code at tag v0.3.0, but that failed too in another way, as reported here.

The result is :

...
Successfully installed Jinja2-2.11.2 MarkupSafe-1.1.1 Pygments-2.7.1 alabaster-0.7.12 appdirs-1.4.4 attrs-20.2.0 babel-2.8.0 certifi-2020.6.20 cfgv-3.2.0 chardet-3.0.4 clang-format-9.0.0 commonmark-0.9.1 distlib-0.3.1 docutils-0.16 filelock-3.0.12 identify-1.5.6 idna-2.10 imagesize-1.2.0 importlib-metadata-2.0.0 iniconfig-1.1.1 nodeenv-1.5.0 packaging-20.4 pluggy-0.13.1 pre-commit-2.7.1 py-1.9.0 pyparsing-2.4.7 pytest-6.1.1 pytest-forked-1.3.0 pytz-2020.1 pyyaml-5.3.1 recommonmark-0.6.0 requests-2.24.0 snowballstemmer-2.0.0 sphinx-3.2.1 sphinx-rtd-theme-0.5.0 sphinxcontrib-applehelp-1.0.2 sphinxcontrib-devhelp-1.0.2 sphinxcontrib-htmlhelp-1.0.3 sphinxcontrib-jsmath-1.0.1 sphinxcontrib-qthelp-1.0.3 sphinxcontrib-serializinghtml-1.1.4 toml-0.10.1 urllib3-1.25.10 virtualenv-20.0.34 zipp-3.3.0
Removed build tracker: '/tmp/pip-req-tracker-4_ygrx7k'
Traceback (most recent call last):
  File "basic_install_test.py", line 19, in <module>
    raise err
  File "basic_install_test.py", line 16, in <module>
    import deepspeed
ModuleNotFoundError: No module named 'deepspeed'
Error on line 204
Fail to install deepspeed

My machine runs Ubuntu:

$ lsb_release -a
No LSB modules are available.
Distributor ID:	Ubuntu
Description:	Ubuntu 18.04.4 LTS
Release:	18.04
Codename:	bionic

This is how I set up the virtual env.

$ which python3.7
/usr/bin/python3.7

$ virtualenv -p /usr/bin/python3.7 ~/.virtualenvs/deepspeed

$ source ~/.virtualenvs/deepspeed/bin/activate

Cloned Deepspeed:

$ git clone [email protected]:microsoft/DeepSpeed.git
$ cd DeepSpeed/

I'm at this commit:

$ git log HEAD^..HEAD
commit 7ddfda85263890615791b63955b9f5ed28ea2eff (HEAD -> master, origin/master, origin/HEAD)
Author: Jeff Rasley <[email protected]>
Date:   Wed Oct 14 10:44:16 2020 -0700

    Add support for p100 in transformer kernels (#470)
    
    add compute cap of 6.0, support p100

How can I resolve this?

The text was updated successfully, but these errors were encountered:

tjruwase · 2020-11-24T06:58:49Z

@visionscaper, we have recently improved our installation. Can you please retry to see if this issue is now fixed? Thanks.

* Merge chatgpt v2 to v3 - finalized (#484) * [squash] staging chatgpt v1 (#463) Co-authored-by: Reza Yazdani <[email protected]> Co-authored-by: yaozhewei <[email protected]> Co-authored-by: Tunji Ruwase <[email protected]> * [partial] formatting fixes * quantizer fixes * fix for bert tests * formatting fixes * re-enable _param_slice_mappings in z2 * Enable the QKV requires_grad when in training mode (#466) Co-authored-by: Jeff Rasley <[email protected]> * fixes for attention enable_training flag * commit to trigger CI * fix for distil-bert param * fixes for training context errors * remove reza's qkv-optimization (#469) Co-authored-by: Jeff Rasley <[email protected]> * Chatgpt - Fuse lora params at HybridEngine (#472) Co-authored-by: Jeff Rasley <[email protected]> * add option to enable non-pin mode (#473) * Chatgpt - fuse lora non pinned case (#474) * Fix fuse/unfuse lora for Z3 and non-pinned parameter * unfuse_lora_weight for non-pinned case * fix the multiple issue for lora parameters * formatting * fuse lora only when available --------- Co-authored-by: Jeff Rasley <[email protected]> * Chatgpt/release inference cache (#475) * Fix fuse/unfuse lora for Z3 and non-pinned parameter * unfuse_lora_weight for non-pinned case * release/retake the inference cache after/before generate * remove duplicated _fuse_lora function * fix formatting * fix hybrid-engine config issue * update formatting * Chatgpt - fuse qkv v2 (#478) Co-authored-by: Jeff Rasley <[email protected]> * ChatGPT: Refactor Hybrid Engine Config (#477) Co-authored-by: Lok Chand Koppaka <[email protected]> * Inference Workspace Tweaks (#481) * Safety checks around inference workspace allocation, extra flushing * Formatting fixes * Merge fix * Chatgpt/inference tp (#480) * Update the merged-QKV weights only if there is difference with the model parameter * remove the hard-coded size * always reset qkv params to updated ones after running step * Add the infernce-tp group and tensor sharding to run inference in model-parallel mode * optimize the gather/mp-sharding part * Add hybrid_engine changes * fix config issue * Formatting fixes. Reset_qkv duplicate removal. * fix bloom container. * fix format. --------- Co-authored-by: Ammar Ahmad Awan <[email protected]> Co-authored-by: Lok Chand Koppaka <[email protected]> * fix formatting * more clean-up --------- Co-authored-by: Jeff Rasley <[email protected]> Co-authored-by: yaozhewei <[email protected]> Co-authored-by: Tunji Ruwase <[email protected]> Co-authored-by: Masahiro Tanaka <[email protected]> Co-authored-by: Michael Wyatt <[email protected]> Co-authored-by: Lok Chand Koppaka <[email protected]> Co-authored-by: Connor Holmes <[email protected]> Co-authored-by: Ammar Ahmad Awan <[email protected]> * fix a bug on lora-fusion (#487) * Cholmes/v3 workspace bugfixes (#488) * Miscellaneous workspace fixes, new config param * Fix typo --------- Co-authored-by: Reza Yazdani <[email protected]> Co-authored-by: Jeff Rasley <[email protected]> Co-authored-by: yaozhewei <[email protected]> Co-authored-by: Tunji Ruwase <[email protected]> Co-authored-by: Masahiro Tanaka <[email protected]> Co-authored-by: Michael Wyatt <[email protected]> Co-authored-by: Lok Chand Koppaka <[email protected]> Co-authored-by: Connor Holmes <[email protected]>

mrwyattii closed this as completed Aug 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fail to install deepspeed through ./install.sh : ModuleNotFoundError: No module named 'deepspeed' #472

Fail to install deepspeed through ./install.sh : ModuleNotFoundError: No module named 'deepspeed' #472

visionscaper commented Oct 14, 2020 •

edited

Loading

tjruwase commented Nov 24, 2020

Fail to install deepspeed through ./install.sh : ModuleNotFoundError: No module named 'deepspeed' #472

Fail to install deepspeed through ./install.sh : ModuleNotFoundError: No module named 'deepspeed' #472

Comments

visionscaper commented Oct 14, 2020 • edited Loading

tjruwase commented Nov 24, 2020

visionscaper commented Oct 14, 2020 •

edited

Loading