-
Notifications
You must be signed in to change notification settings - Fork 312
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
An unexpected error occurred: cannot import name util in jobwatcher and sqswatcher #1236
Comments
attached both watcher log files |
#1142 this issue might contain relevant discussion. Do GNOME Desktop or VNC Server have the paramiko package as a dependency/subdependency? If it is indeed another issue with python package conflicts, it looks from the release notes like updating to pcluster 2.4.1 might solve your issue due to the switch to isolated virtualenvs for the node daemons. |
Hi, indeed that's exactly what happened. The custom packages broke the system python deps used by ParallelCluster, hence the scaling daemons are failing. Upgrading to 2.4.1 will solve the issue since the pcluster daemons will run into isolated virtualenvs. |
hi I upgraded parallelcluster to 2.4.1 and ran "pcluster create" with same config file. But I got "Unexpected error of type ValueError: too many values to unpack" like following. $ pcluster create lustre-dev -c lustre-dev |
Could you share the config with us? |
here is my config file |
I also tried #1241 (comment) and here is the traceback Traceback (most recent call last): |
Thanks for sharing additional debugging info. The issue you are facing is the same reported here: #1241. Fix is merged but will be available in the next version of ParallelCluster. you can disable sanity_check as a workaround |
I disabled sanity_check and added "autoScaling:SetInstanceHealth" into InstanceRole. And then my issue was fixed. Thank you all! |
Environment:
Bug description and how to reproduce:
I installed GNOME Desktop and VNC Server on the master node. And then I rebooted the master node.
After that, when I submit jobs to SGE, the cluster never scale out. And I found "An unexpected error occurred: cannot import name util" in jobwatcher and sqswatcher
How can I fix this?
The text was updated successfully, but these errors were encountered: