-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cannot run remote nn classification workflow #181
Comments
You have to start it with --addr 0.0.0.0 |
It may be an issue with the cuda versions. In this cases, device list doesn't show
I added some expanded cuda logging to server startup sequence to simplify debugging |
@m-novikov you are right, the problem is that CUDA is not available:
and this is due to the fact that my env has a cpu only pytorch installed:
Note that I followed the tiktorch installation instructions: https://github.com/ilastik/tiktorch#installation. So for some reason this env pulls in a cpu pytorch. |
I think this is an issue with |
hey @constantinpape, could you try installing the conda package instead of creating the devenv? conda create -n tiktorch-server-env -c ilastik-forge -c conda-forge tiktorch cudatoolkit=YOURPREFERREDVERSION I recently updated the conda recipe, while the |
@k-dominik yes, that works; I had to add
|
sorry, of course, I forgot the |
Default installation command on linux uses cpu installation:
results in
Seemls like because of strict channel priority, if I disable strict channel priority or rearrange command to have
Maybe we should consider specifying build-string constraint for pytorch e.g |
I tried to run the remote nn classification workflow (client: my laptop, server: embl gpu 7), but ran int some issues:
See screenshots (server was still running when I tried to connect):
The text was updated successfully, but these errors were encountered: