Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error running after environment is configured on the server #12

Open
WanLang0 opened this issue Jan 29, 2023 · 1 comment
Open

Error running after environment is configured on the server #12

WanLang0 opened this issue Jan 29, 2023 · 1 comment

Comments

@WanLang0
Copy link

WanLang0 commented Jan 29, 2023

Hi,I use the Ubuntu 20.04 system, and the server in the Python 3.8/cuda11.6 environment runs this command
python3 main.py --config ./configs/poisson/cifar10_ddpmpp.py --mode train --workdir poisson_ddpmpp
The following problems have occurred. Would you please check them? Thank you

WARNING:tensorflow:From /root/miniconda3/envs/myconda/lib/python3.8/site-packages/tensorflow_gan/python/estimator/tpu_gan_estimator.py:42: The name tf.estimator.tpu.TPUEstimator is deprecated. Please use tf.compat.v1.estimator.tpu.TPUEstimator instead.

WARNING:tensorflow:From /root/miniconda3/envs/myconda/lib/python3.8/site-packages/tensorflow_gan/python/estimator/tpu_gan_estimator.py:42: The name tf.estimator.tpu.TPUEstimator is deprecated. Please use tf.compat.v1.estimator.tpu.TPUEstimator instead.

1 Physical GPUs, 1 Logical GPUs
I0129 10:36:33.654870 140394431960256 xla_bridge.py:356] Unable to initialize backend 'tpu_driver': NOT_FOUND: Unable to find driver in registry given worker: 
I0129 10:36:33.660157 140394431960256 xla_bridge.py:356] Unable to initialize backend 'rocm': NOT_FOUND: Could not find registered platform with name: "rocm". Available platform names are: Interpreter Host CUDA
I0129 10:36:33.661039 140394431960256 xla_bridge.py:356] Unable to initialize backend 'tpu': module 'jaxlib.xla_extension' has no attribute 'get_tpu_client'
I0129 10:36:33.661505 140394431960256 xla_bridge.py:356] Unable to initialize backend 'plugin': xla_extension has no attributes named get_plugin_device_client. Compile TensorFlow with //tensorflow/compiler/xla/python:enable_plugin_device set to true (defaults to false) to enable this.
2023-01-29 10:36:33.680217: W tensorflow/core/platform/cloud/google_auth_provider.cc:184] All attempts to get a Google authentication bearer token failed, returning an empty token. Retrieving token from files failed with "NOT_FOUND: Could not locate the credentials file.". Retrieving token from GCE failed with "FAILED_PRECONDITION: Error executing an HTTP request: libcurl code 6 meaning 'Couldn't resolve host 
@Newbeeer
Copy link
Owner

Newbeeer commented Feb 9, 2023

Hi,

It seems that the snippet only contains partial error message. Could you please provide full one? Thanks.

In addition, we recommend using our latest model PFGM++ for research projects (https://github.com/Newbeeer/pfgmpp)

Best,
Yilun

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants