-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
zombie processes from dvc pull failures #3744
Comments
Hi @ychou85 ! Would be interesting to see more details or a reproduction script. But as is I would suspect the analytics worker, which is spawned by dvc on each CLI command. You can totally see it in the process list by |
@ychou85 Not sure what |
@ychou85 Also, I'm not sure how you are launching those dvc processes in the first place. Are you doing that directly from the dockerfile? Or from your own wrapper? If the latter, please double check you are properly collecting (akak |
@ychou85 could you please try to opt-out from the analytics as described here - https://dvc.org/doc/user-guide/analytics ? @efiop can it be updater daemon also? |
@ychou85 could please run |
Closing as stale |
Please provide information about your setup
DVC version(i.e.
dvc --version
), Platform and method of installation (pip, homebrew, pkg Mac, exe (Windows), DEB(Linux), RPM(Linux))DVC version 0.93.0
Platform: DGX2 kubernetes cluster, installation with pip,
The base image we use.
FROM nvcr.io/nvidia/pytorch:19.10-py3
When we call dvc pull, if the pull fails for any reason (lack of right credentials for one, or if we ctr-C), we see a lot of zombie processes start spawning on our pod. This occurs to the point that it ties up all free resources on the cluster and grinds things to a halt. Can someone look into this please? It's affecting a pilot team promoting DVC usage at a major healthcare corporation.
The text was updated successfully, but these errors were encountered: