Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Model/Framework] What is the problem?RUN pip install lamb_amp_opt/ #1446

Open
saliedev051 opened this issue Jan 24, 2025 · 0 comments
Open
Labels
bug Something isn't working

Comments

@saliedev051
Copy link

saliedev051 commented Jan 24, 2025

Related to Model/Pytorch
(e.g. GNMT/PyTorch or FasterTransformer/All)

Describe the bug
when i when i run - bash scripts/docker/build.sh , at the last step of the build, i am getting below error

=> ERROR [11/11] RUN pip install lamb_amp_opt/ 1.5s

[11/11] RUN pip install lamb_amp_opt/:
0.490 Looking in indexes: https://pypi.org/simple, https://pypi.ngc.nvidia.com
0.493 Processing ./lamb_amp_opt
0.493 DEPRECATION: A future pip version will change local packages to be built in-place without first copying to a temporary directory. We recommend you use --use-feature=in-tree-build to test your packages with this new behavior before it becomes the default.
0.493 pip 21.3 will remove support for this functionality. You can find discussion regarding this at pypa/pip#7555.
0.672 ERROR: Command errored out with exit status 1:
0.672 command: /opt/conda/bin/python3.8 -c 'import io, os, sys, setuptools, tokenize; sys.argv[0] = '"'"'/tmp/pip-req-build-e2j59bku/setup.py'"'"'; file='"'"'/tmp/pip-req-build-e2j59bku/setup.py'"'"';f = getattr(tokenize, '"'"'open'"'"', open)(file) if os.path.exists(file) else io.StringIO('"'"'from setuptools import setup; setup()'"'"');code = f.read().replace('"'"'\r\n'"'"', '"'"'\n'"'"');f.close();exec(compile(code, file, '"'"'exec'"'"'))' egg_info --egg-base /tmp/pip-pip-egg-info-iyrisslh
0.672 cwd: /tmp/pip-req-build-e2j59bku/
0.672 Complete output (13 lines):
0.672 Traceback (most recent call last):
0.672 File "", line 1, in
0.672 File "/opt/conda/lib/python3.8/site-packages/setuptools/init.py", line 27, in
0.672 from .dist import Distribution
0.672 File "/opt/conda/lib/python3.8/site-packages/setuptools/dist.py", line 30, in
0.672 from . import (
0.672 File "/opt/conda/lib/python3.8/site-packages/setuptools/_entry_points.py", line 6, in
0.672 from jaraco.text import yield_lines
0.672 File "/opt/conda/lib/python3.8/site-packages/setuptools/_vendor/jaraco/text/init.py", line 12, in
0.672 from jaraco.context import ExceptionTrap
0.672 File "/opt/conda/lib/python3.8/site-packages/setuptools/_vendor/jaraco/context.py", line 17, in
0.672 from backports import tarfile
0.672 ImportError: cannot import name 'tarfile' from 'backports' (/opt/conda/lib/python3.8/site-packages/backports/init.py)
0.672 ----------------------------------------
0.673 WARNING: Discarding file:///workspace/bert/lamb_amp_opt. Command errored out with exit status 1: python setup.py egg_info Check the logs for full command output.
0.673 ERROR: Command errored out with exit status 1: python setup.py egg_info Check the logs for full command output.


Dockerfile:35

33 | RUN python -m nltk.downloader punkt
34 |
35 | >>> RUN pip install lamb_amp_opt/
36 |

ERROR: failed to solve: process "/bin/sh -c pip install lamb_amp_opt/" did not complete successfully: exit code: 1

To Reproduce
Steps to reproduce the behavior:

  1. Install '...'Clone BERT mode, install requirements,
  2. Set "..."
  3. Launch '.bash scripts/docker/build.sh'

Expected behavior
A clear and concise description of what you expected to happen.

Environment
Please provide at least:
python: 3.6

  • GPUs in the system: (e.g. 8x Tesla V100-SXM2-16GB): 2 x H100 NVL
  • CUDA driver version (e.g. 418.67):12.7
@saliedev051 saliedev051 added the bug Something isn't working label Jan 24, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant