-
-
Notifications
You must be signed in to change notification settings - Fork 48
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Xen 4.14: X crashes/hangs #6097
Comments
Seems I spoke too soon. xfce also hangs, it just takes longer, and is fully usable for a few hours. |
It also freezes with external monitor throuth HDMI. |
I'm having significant KDE startup issues w certain desktop components starting sometimes / sometimes not. In particular, the desktop seems to fail to start or crash, leaving a black area with no contents other than app windows. BTW installing the amdgpu driver hasn't helped this (although seems to help in other areas). A related problem is that the desktop wallpaper and desktop icon settings are forgotten. The problem is aggravated by the |
Also, with the Switching to my delayed user script seems to increase the sys-usb success rate a lot. |
do you think this relates to this, @fepitre ?: https://xen.markmail.org/search/?q=xen%204.14%20xorg#query:xen%204.14%20xorg%20order%3Adate-backward+page:2+mid:mgtqx544txdmfqjr+state:results |
@0spinboson to confirm that would be the same, a wordaround for the issue I have is to add |
This is eerily reminiscent of a bug I had running kernel-latest on R4.0. I will try to reproduce. |
hm. I tried the Xen 4.14.0-9 build, and i'm now also seeing the lockups. |
hm. looks like Xen 4.14.1 contains no relevant fixes, unless I'm missing something? Shame. |
Try adding |
You can also try dropping |
yeah that isn't too much of a worry for me, since I'm using an AMD cpu. but will try, thank you. |
it at least is better than it was before, haven't seen any CPU hangs yet with about 4h uptime; which is an improvement. Though I understood from fepitre's xen-devel mails that for him it only delayed the inevitable, and I don't see any sched-related fixes for x86? But will let you know how long it lasts. Kernel still 5.8.16, will try 5.10.5 once this proves stable for 30h or so. |
I made it through 24h without any strange issues, which is unheard of compared to before. kernel 5.10 and 5.9 both are unstable as hell though, but idk what to do with that since the crashes don't leave any logs. |
Must say the xen documentation isn't great. https://wiki.xenproject.org/wiki/Xen_Project_4.12_Feature_List says credit2 is the 'new default scheduler', and so does https://xenbits.xen.org/docs/4.14-testing/features/sched_credit2.html, yet https://xenbits.xen.org/docs/4.14-testing/SUPPORT.html#credit-scheduler says credit is the default. I guess the latter isn't update, but why does this page even exist if it's not used any more? Ugh. |
5.10.7 brings been quite a few fixes for common regressions in 5.9 and earlier 5.10 (at least on Intel HW). It's in current-testing already, can you give it a try? |
okay. it seems to have something to do with my dual monitor setup. if I only have 1 monitor enabled during login, no problem. If I turn the second monitor on and then reboot the VM, it even 'gets' the correct display dimensions so I can click anywhere no matter where I put the VM window. But as soon as I login with 2 4k displays enabled, instant reboot. |
was stable once I got past login until I opened "too many windows" at the same time, at which point I got another hard reboot. Not sure which component / interaction causes this problem, but it's an issue that doesn't occur at all with kernel 5.8.16, and it seems to have to do with drawing windows. |
tested this weekend using xen 4.14.1-2, kernel 5.4.98. Kernel 5.9 and up crash as soon as I open too many windows at the same time, but are stable so long as I'm in dom0 with no VMs spun up. Kernel 5.8.16-1 works fine. I decided to test kernel 5.4 over the weekend, and I've noticed something similar happening there, although this kernel series does result in a system that's usable/stable for at least a day or two. I've attached a log file with more info, but this is part of what happens: Feb 20 22:59:46 dom0 kernel: Xorg: page allocation failure: order:6, mode:0x40dc0(GFP_KERNEL|__GFP_COMP|__GFP_ZERO), nodemask=(null),cpuset=/,mems_allowed=0 near-full log: log.txt (I should probably rename this issue as it seems to be a memory allocation or grant page issue.) |
This looks like this mapping is using actual dom0 memory, not only mapping VM's memory into its own address space. This sounds similar to the situation described in https://xenbits.xen.org/xsa/advisory-300.html. |
I changed the buffer size in the manner and to the size suggested by brendan, but I still only get two such headers. Is that normal, or should I increase it further? |
Allow to not use otherwise usable RAM pages to map foreign pages (including grant mappings). This is especially useful for dom0 (or GUI domain) that maps a lot of foreign pages. QubesOS/qubes-issues#6097
Allow to not use otherwise usable RAM pages to map foreign pages (including grant mappings). This is especially useful for dom0 (or GUI domain) that maps a lot of foreign pages. QubesOS/qubes-issues#6097 (cherry picked from commit 3e71ce9)
I'm still trying to capture the debug-keys output, but it looks like kernel 5.11(.4-1) is much more stable again, not tried 5.10.21 yet. I've never made it past 4hrs uptime since 5.9.x, and currently i'm at 21h. |
Allow to not use otherwise usable RAM pages to map foreign pages (including grant mappings). This is especially useful for dom0 (or GUI domain) that maps a lot of foreign pages. QubesOS/qubes-issues#6097 (cherry picked from commit 3e71ce9ed6f0c57129e767f42d16fce5a3c7b1e9)
Qubes OS version
Qubes 4.1 up to date with current-testing
rx vega56, amdgpu driver, xorg-x11-driver-amdgpu installed
kernel 5.8.10
Affected component(s) or functionality
kde plasma 5.18
Brief summary
when I try to log in to kde plasma desktop, it either hangs for about a minute before crashing back to login screen (after which I sometimes can no longer connect to running VMs, or start new ones), or it hangs or reboots.
To Reproduce
Steps to reproduce the behavior:
Expected behavior
desktop works as it should
Actual behavior
hangs, reboots, x crash
starting kde applications (including the settings manager) while using xfce can also lead to an unresponsive system.
Additional context
Xorg.log.txt
xsessold.txt
Solutions you've tried
xfce works fine, reverting to xen 4.13 also works.
The text was updated successfully, but these errors were encountered: