"unable to handle kernel paging request" at kmem_cache_alloc, and hung processes #7987

vthriller · 2018-10-04T21:40:09Z

System information

Type	Version/Name
Distribution Name	Gentoo
Distribution Version	—
Linux Kernel	gentoo-sources-4.9.95
Architecture	x86_64
ZFS Version	0.7.9-r0-gentoo
SPL Version	0.7.9-r0-gentoo

Describe the problem you're observing

spotted unusual CPU usage statistics (lots of iowait)
typed in dmesg and found a bunch of oopses
tried random processes like ls on one of the ZFS mountpoints only to see them hang in D state indefinitely
typed in htop and found the following in D state:
- txg_sync
- khugepaged
- dbuf_evict
- and a couple of aforementioned userspace processes (but nothing that I didn't run while poking this thing around)

Describe how to reproduce the problem

No idea what triggered this, was away at the moment when CPU usage jumped up according to monitoring system, and no cron jobs were scheduled around said time either.

I have a slight suspicion that it might have something to do with zram-backed swap, so I'm currently swapping it off to a disk-backed swap, although I doubt that it might affect anything at this point.

Include any warning/errors/backtraces from the system logs

Again, this is what dmesg shows at the moment.

The text was updated successfully, but these errors were encountered:

vthriller · 2018-10-04T23:48:05Z

I have a slight suspicion that it might have something to do with zram-backed swap, so I'm currently swapping it off to a disk-backed swap, although I doubt that it might affect anything at this point.

Well, swapoff processes stalled relatively quickly and are not killable, and swapon --show shows the exact same values for well over an hour now. No new kernel log messages though.

Unfortunately this kernel has CONFIG_CRASH_DUMP unset, so I'm going to leave this issue as it is and force-reboot the system after 136 days of uptime.

At last, here are the traces for all blocked processes (sysrq-w).

(#4319 and #6880 is the closest I was able to google, but I'm not sure whether these issues are really that relevant.)

vthriller · 2019-05-30T17:21:22Z

I have a slight suspicion that it might have something to do with zram-backed swap

Well, 156 days of uptime later I got the same thing without zram block devices.

vthriller · 2019-05-30T17:53:23Z

Well, backtraces didn't change that much from the last time, except for missing ARC functions in the middle of the stack.

try reproducing it with a new version

Thanks, I'm already planning an upgrade for both kernel and ZoL.

stale · 2020-08-24T19:53:50Z

This issue has been automatically marked as "stale" because it has not had any activity for a while. It will be closed in 90 days if no further activity occurs. Thank you for your contributions.

vthriller changed the title ~~"unable to handle kernel paging request" at kmem_cache_alloc and hung processes~~ "unable to handle kernel paging request" at kmem_cache_alloc, and hung processes Oct 4, 2018

vthriller mentioned this issue Jun 9, 2019

ARC- and zrlock-related panics during import #8876

Closed

stale bot added the Status: Stale No recent activity for issue label Aug 24, 2020

stale bot closed this as completed Nov 25, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

"unable to handle kernel paging request" at kmem_cache_alloc, and hung processes #7987

"unable to handle kernel paging request" at kmem_cache_alloc, and hung processes #7987

vthriller commented Oct 4, 2018 •

edited

Loading

vthriller commented Oct 4, 2018

vthriller commented May 30, 2019

vthriller commented May 30, 2019

stale bot commented Aug 24, 2020

"unable to handle kernel paging request" at kmem_cache_alloc, and hung processes #7987

"unable to handle kernel paging request" at kmem_cache_alloc, and hung processes #7987

Comments

vthriller commented Oct 4, 2018 • edited Loading

System information

Describe the problem you're observing

Describe how to reproduce the problem

Include any warning/errors/backtraces from the system logs

vthriller commented Oct 4, 2018

vthriller commented May 30, 2019

vthriller commented May 30, 2019

stale bot commented Aug 24, 2020

vthriller commented Oct 4, 2018 •

edited

Loading