-
Notifications
You must be signed in to change notification settings - Fork 69
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Turning off graphics card hangs indefinitely #105
Comments
@OctarineSorcerer can you reproduce this issue if you use the nvidia-xrun script from my fork? |
Nope, using your fork it seems to stop just fine - after those lines it shows
|
That's the expected behavior, as `tee` will echo auto to the terminal in
addition to writing it on the file. If everything worked fine you should
see a decrease in the power drain in powertop after the bus PM is set to
`auto`.
As I suspected there might be something wrong with the logic the in
upstream script, but so far I didn't manage to get in touch with @Witko to
discuss.
|
That makes sense - give me a couple minutes, and I'll test out the powertop results EDIT: @michelesr Power drain in powertop went down from ~18W to ~10-12W, and I also couldn't see it anymore on powertop's "Device stats" tab. I'll second that the power management in your fork seems to work well, without the issues here. |
Good, just to clarify: Linux PM should automatically put the bus (and the device) in power saving mode when it's not utilized, as long the value of the PM control is set to Removing the device from the tree is a (dirty) workaround to prevent those program to load the nvidia kernel module so that Linux can put the bus in the power saving mode. |
I see, seems like a sensible workaround, and for now I'm using your fork. |
My script instead disables the card EDIT: actually the reason for the hang is another, read my next comment. |
can reproduce. the @michelesr fork works. |
The reason for the hang is that the kernel modules aren't really unloaded, as specified in #95 and #101. If you unload the modules properly (via If you don't use the nvidia card at all you can simple uninstall the |
I am using nvidia-xrun-pm fork. According to the dmesg, seems like Nvidia module crashed because the device was used:
And this causes full system hung, I am not able even restart PC:
|
I tried to enable and disable scripts from this wiki. I found there are some differences between |
@OctarineSorcerer the code from my fork has been merged here, can you please see if it works for you so that this issues can be closed? |
@michelesr Starting |
This issue is not resolved. If nvidia_drm module cant be unloaded for whatever reason (and consequently all modules fail to unload), computer will go rogue, and reboot is required. There should be some check after unload_modules to see if nvidia modules unloaded successfully. |
@wioo have you tried loading |
Thanks for reply @rihardsk |
Any idea the state of this issue upstream?
Was thinking to try @michelesr's fork, but noticed it's been a few years since an update... |
Probably not a good idea, my fork was merged upstream a long time ago, so there's no benefit in using that and it's probably outdated |
Noted, thank you kindly. Seems I was a bit more behind in my updates than I thought. Now to fix a |
Not sure if it's a viable solution for you, but you might want to try PRIME offload instead |
Might try it out perhaps. Originally used Anyway, getting off topic now I guess |
Starting
nvidia-xrun
seems to work fine, but when I exit from Xorg, it hangs on:Opening htop on another tty,
tee /sys/bus/pci/devices/0000:00:01.0/remove
is still running. If I try to terminate this, whether with a SIGTERM or SIGKILL, I get 100% CPU usage on one core, and it doesn't seem to end.I think similar symptoms have been talked about by @michelesr as part of issue #94
The text was updated successfully, but these errors were encountered: