Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sys-net occasionally dies on resume from suspend #4658

Closed
andrewdavidwong opened this issue Dec 25, 2018 · 44 comments
Closed

sys-net occasionally dies on resume from suspend #4658

andrewdavidwong opened this issue Dec 25, 2018 · 44 comments
Labels
affects-4.1 This issue affects Qubes OS 4.1. bounty This issue has a public bounty associated with it. C: core eol-4.1 Closed because Qubes 4.1 has reached end-of-life (EOL) P: major Priority: major. Between "default" and "critical" in severity. T: bug Type: bug report. A problem or defect resulting in unintended behavior in something that exists.

Comments

@andrewdavidwong
Copy link
Member

andrewdavidwong commented Dec 25, 2018

Qubes OS version:

R3.2

Affected component(s):

sys-net


Steps to reproduce the behavior:

  1. Close lid to suspend.
  2. Wait for machine to suspend.
  3. Open lid.
  4. Machine resumes from suspend.
  5. Unlock screen locker.

Expected behavior:

sys-net stays on and continues to provide network access normally.

Actual behavior:

Occasionally:

  • NetworkManager in sys-net looks like it's trying to connect for a second, then the entire sys-net just dies (shows powered off in Qubes Manager).
  • sys-firewall and AppVMs using sys-firewall for network access are still running normally, but of course they don't have network access.
  • Restarting just sys-net does not restore network access to these other AppVMs.
  • Instead, they must all be restarted, beginning with sys-net. (Maybe restarting just sys-net and sys-firewall would be enough, but in practice it's easier for me just to shut them all down and restart them all.)

General notes:

This has been going on for at least a few months. When reading #4657, I saw that it mentioned this problem with sys-net in passing. I thought we already had an issue on this (see below) but couldn't find one, so I'm filing this now.


Related issues:

I could have sworn we already had an issue about this, but after searching, I can't find one. These all look different:

#2964 was about losing network access when sys-net stays on
#3008/#3030 was about failing to connect to a network after resume when sys-net stays on
#3151 was about NetworkManager not running in sys-net after resume when sys-net stays on
#3738 was about the entire computer not resuming correctly from suspend.

Ah, maybe I was thinking of #4042, which is a similar report about sys-usb.

@andrewdavidwong andrewdavidwong added T: bug Type: bug report. A problem or defect resulting in unintended behavior in something that exists. C: core P: major Priority: major. Between "default" and "critical" in severity. labels Dec 25, 2018
@andrewdavidwong andrewdavidwong added this to the Release 3.2 updates milestone Dec 25, 2018
@brycepg
Copy link

brycepg commented Dec 29, 2018

@unman
Copy link
Member

unman commented Dec 30, 2018

@andrewdavidwong @brycepg I feel this must be hardware specific, since I dont see it on any of the machines that I still have access to running 3.2.1.
It might be helpful if you could post details of the hardware you are using, and specifically the NIC.
I'm assuming that you see this using both Fedora and Debian templates? If you haven't checked please do so.

@brycepg
Copy link

brycepg commented Dec 30, 2018

@unman

Laptop model: Lenovo Thinkpad T530
Wireless adapter: Intel Centrino Advanced-N 6205 [Taylor Peak] (rev 34)
I've only tested this issue on my fedora-28 template. Will try with debian-9
CPU: i5-3210m (no IOMMU)

EDIT: Strangely I cannot repro right now with a quick suspend. Will try out with longer suspend times.

@andrewdavidwong
Copy link
Member Author

I should have clarified that this only happens occasionally, so it would be difficult to reproduce without suspending/resuming many times. I'll update the issue to reflect this.

Lenovo Thinkpad T450s:

Xen:		4.6.6
Kernel:		4.14.74-1

RAM:		20194 Mb

CPU:
  Intel(R) Core(TM) i7-5600U CPU @ 2.60GHz
Chipset:
  Intel Corporation Broadwell-U Host Bridge -OPI [8086:1604] (rev 09)
VGA:
  Intel Corporation HD Graphics 5500 [8086:1616] (rev 09) (prog-if 00 [VGA controller])

Net:
  Intel Corporation Ethernet Connection (3) I218-LM (rev 03)
  Intel Corporation Wireless 7265 (rev 59)

@andrewdavidwong andrewdavidwong changed the title sys-net dies on resume from suspend sys-net occasionally dies on resume from suspend Jan 1, 2019
@4vrel
Copy link

4vrel commented Jan 25, 2019

same issue, thinkpad x220, qubes 4.0

Maybe a script that shut down and restart all net-vm could be a temporary fix ? ( its pretty long and painful to do it manually )

@tasket
Copy link

tasket commented Feb 6, 2019

This happened to me recently after re-doing some templates and VMs. What I discovered was that my wifi module suspend settings were gone.... I forgot to re-add them.

I agree with unman this is a hardware-specific issue. OTOH it would be nice if Qubes had some way of automatically populating this module information.

Here is what I use in /rw/config/suspend-module-blacklist for an Intel "Ultimate-N" card:

iwldvm
iwlwifi

@aslfv
Copy link

aslfv commented Mar 7, 2019

In my case, sys-net will always die on resume from suspend, if another VM is in a transient state (i.e. qube manager displaying the VM with a yellow dot). This is in particular true for the following two cases which others may be able to reproduce as well:

  1. the other VM is starting while the machine is suspending.
  2. A windows VM -- without QWT -- is running, while the machine is suspending.

The result in my case is the following output in sys-net:
[ 372.065513] ath10k_pci 0000:00:06.0: failed to wake target for read32 at 0x0003a028: -110
[ 372.657769] clocksource: timekeeping watchdog on CPU0: Marking clocksource 'tsc' as unstable because the skew is too large:
[ 372.657817] clocksource: 'xen' wd_now: 27c2a46c622 wd_last: 27c0c3c4a2a mask: ffffffffffffffff
[ 372.657856] clocksource: 'tsc' cs_now: ffffff9953059778 cs_last: ffffffecec397004 mask: ffffffffffffffff
[ 372.657902] tsc: Marking TSC unstable due to clocksource watchdog
Please also note that this may occasionally occur even for VMs that are not connected to a NetVM.

In the case of sys-usb the machine may even entirely fail to resume, and instead reboots.

I hope that this is relevant for this issue. If not please indicate how I can proceed. For the record, essentially the machine (DELL XPS 13 9360) is running R4.0.

@marmarek
Copy link
Member

marmarek commented Mar 7, 2019

@aslfv your case looks like #3489

@quantumpacket
Copy link

quantumpacket commented Oct 15, 2019

I'm experiencing this issue with a Librem 13 laptop and my easiest solution is to kill the sys-net qube and then start it up again. If I do a simple restart I am forced to restart all dependent qubes, which is really inconvenient. So I opt for the kill method, which has worked for me every time. Is there anything I can provide to help with debugging this?

@mfc mfc added the bounty This issue has a public bounty associated with it. label Oct 16, 2019
@lrvick
Copy link

lrvick commented Dec 7, 2019

Myself and several peers all have librem15 and librem13 devices and sys-net dies often on resume exactly as @quantumpacket describes.

This is a serious PITA we would love help with.

@tetrahedras
Copy link

As per the discussion on qubes-users ("[qubes-users] sys-net keeps dying") here is the dmesg from a netvm (formerly fedora-29, upgraded to fedora-30) as it begins to exhibit the described behaviour:

[ 9266.512872] IPv6: ADDRCONF(NETDEV_UP): ens7: link is not ready
[ 9266.770001] IPv6: ADDRCONF(NETDEV_UP): ens7: link is not ready
[ 9266.792354] IPv6: ADDRCONF(NETDEV_UP): wls6: link is not ready
[ 9268.821814] iwlwifi 0000:00:06.0: Error sending REPLY_SCAN_ABORT_CMD: time out after 2000ms.
[ 9268.821849] iwlwifi 0000:00:06.0: Current CMD queue read_ptr 29 write_ptr 30
[ 9268.821922] iwlwifi 0000:00:06.0: Loaded firmware version: 18.168.6.1
[ 9268.822451] iwlwifi 0000:00:06.0: 0x00000000 | OK
[ 9268.822477] iwlwifi 0000:00:06.0: 0x00000000 | uPc
[ 9268.822494] iwlwifi 0000:00:06.0: 0x00000000 | branchlink1
[ 9268.822510] iwlwifi 0000:00:06.0: 0x00000000 | branchlink2
[ 9268.822568] iwlwifi 0000:00:06.0: 0x00000000 | interruptlink1
[ 9268.822590] iwlwifi 0000:00:06.0: 0x00000000 | interruptlink2
[ 9268.822611] iwlwifi 0000:00:06.0: 0x00000000 | data1
[ 9268.822629] iwlwifi 0000:00:06.0: 0x00000000 | data2
[ 9268.822655] iwlwifi 0000:00:06.0: 0x00000000 | line
[ 9268.822672] iwlwifi 0000:00:06.0: 0x00000000 | beacon time
[ 9268.822690] iwlwifi 0000:00:06.0: 0x00000000 | tsf low
[ 9268.822716] iwlwifi 0000:00:06.0: 0x00000000 | tsf hi
[ 9268.822734] iwlwifi 0000:00:06.0: 0x00000000 | time gp1
[ 9268.822760] iwlwifi 0000:00:06.0: 0x00000000 | time gp2
[ 9268.822777] iwlwifi 0000:00:06.0: 0x00000000 | time gp3
[ 9268.822795] iwlwifi 0000:00:06.0: 0x00000000 | uCode version
[ 9268.822825] iwlwifi 0000:00:06.0: 0x00000000 | hw version
[ 9268.822852] iwlwifi 0000:00:06.0: 0x00000000 | board version
[ 9268.822873] iwlwifi 0000:00:06.0: 0x00000000 | hcmd
[ 9268.822899] iwlwifi 0000:00:06.0: 0x00000000 | isr0
[ 9268.822916] iwlwifi 0000:00:06.0: 0x00000000 | isr1
[ 9268.822942] iwlwifi 0000:00:06.0: 0x00000000 | isr2
[ 9268.822960] iwlwifi 0000:00:06.0: 0x00000000 | isr3
[ 9268.822986] iwlwifi 0000:00:06.0: 0x00000000 | isr4
[ 9268.823004] iwlwifi 0000:00:06.0: 0x00000000 | isr_pref
[ 9268.823030] iwlwifi 0000:00:06.0: 0x00000000 | wait_event
[ 9268.823048] iwlwifi 0000:00:06.0: 0x00000000 | l2p_control
[ 9268.823075] iwlwifi 0000:00:06.0: 0x00000000 | l2p_duration
[ 9268.823093] iwlwifi 0000:00:06.0: 0x00000000 | l2p_mhvalid
[ 9268.823119] iwlwifi 0000:00:06.0: 0x00000000 | l2p_addr_match
[ 9268.823149] iwlwifi 0000:00:06.0: 0x00000000 | lmpm_pmg_sel
[ 9268.823167] iwlwifi 0000:00:06.0: 0x00000000 | timestamp
[ 9268.823184] iwlwifi 0000:00:06.0: 0x00000000 | flow_handler
[ 9268.823413] iwlwifi 0000:00:06.0: Start IWL Event Log Dump: nothing in log
[ 9268.823453] iwlwifi 0000:00:06.0: Command REPLY_RXON failed: FW Error
[ 9268.823485] iwlwifi 0000:00:06.0: Error clearing ASSOC_MSK on BSS (-5)
[ 9268.835622] ieee80211 phy0: Hardware restart was requested
[ 9268.849978] iwlwifi 0000:00:06.0: Radio type=0x1-0x2-0x0
[ 9269.152998] iwlwifi 0000:00:06.0: Radio type=0x1-0x2-0x0
[ 9269.240518] IPv6: ADDRCONF(NETDEV_UP): wls6: link is not ready
[ 9269.256202] iwlwifi 0000:00:06.0: Radio type=0x1-0x2-0x0
[ 9270.288593] audit: type=1131 audit(1575954363.530:135): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=systemd-rfkill comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[ 9274.645527] iwlwifi 0000:00:06.0: Failed to load firmware chunk!
[ 9274.645558] iwlwifi 0000:00:06.0: iwlwifi transaction failed, dumping registers
[ 9274.645584] iwlwifi 0000:00:06.0: iwlwifi device config registers:
[ 9274.669420] iwlwifi 0000:00:06.0: 00000000: 00858086 00100406 02800034 00000000 f2044004 00000000 00000000 00000000
[ 9274.669467] iwlwifi 0000:00:06.0: 00000020: 00000000 00000000 00000000 13118086 00000000 000000c8 00000000 0000010b
[ 9274.669505] iwlwifi 0000:00:06.0: iwlwifi device memory mapped registers:
[ 9274.669608] iwlwifi 0000:00:06.0: 00000000: 00488700 00000040 08000000 00000000 00000001 00000000 00000030 00000000
[ 9274.669652] iwlwifi 0000:00:06.0: 00000020: 00000001 080403c5 000000b0 00000000 90000001 00030001 80008040 00080044
[ 9274.669694] iwlwifi 0000:00:06.0: Could not load the [0] uCode section
[ 9274.684532] iwlwifi 0000:00:06.0: Failed to run INIT ucode: -110
[ 9274.684563] iwlwifi 0000:00:06.0: Fw not loaded - dropping CMD: 81
[ 9274.684628] iwlwifi 0000:00:06.0: Unable to initialize device.
[ 9274.684650] ------------[ cut here ]------------
[ 9274.684667] Hardware became unavailable during restart.
[ 9274.684732] WARNING: CPU: 1 PID: 1761 at /home/user/rpmbuild/BUILD/kernel-4.19.84/linux-4.19.84/net/mac80211/util.c:1936 ieee80211_reconfig+0x236/0x1140 [mac80211]
[ 9274.684774] Modules linked in: iwldvm iwlwifi mac80211 cfg80211 ehci_pci ehci_hcd xt_nat ccm fuse nft_reject_ipv4 nft_reject nft_ct nf_tables nfnetlink ip6table_raw iptable_raw xen_netback xt_REDIRECT ip6table_filter ip6_tables xt_conntrack ipt_MASQUERADE iptable_nat nf_nat_ipv4 nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c joydev arc4 intel_rapl crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel intel_rapl_perf ttm pcspkr serio_raw e1000e drm_kms_helper ata_generic pata_acpi rfkill drm i2c_piix4 floppy u2mfn(O) xen_gntdev xen_gntalloc xen_blkback xenfs xen_evtchn xen_privcmd overlay xen_blkfront [last unloaded: cfg80211]
[ 9274.684980] CPU: 1 PID: 1761 Comm: kworker/1:1 Tainted: G           O      4.19.84-1.pvops.qubes.x86_64 #1
[ 9274.685008] Hardware name: Xen HVM domU, BIOS 4.8.5-12.fc25 11/13/2019
[ 9274.685042] Workqueue: events_freezable ieee80211_restart_work [mac80211]
[ 9274.685081] RIP: 0010:ieee80211_reconfig+0x236/0x1140 [mac80211]
[ 9274.685104] Code: 44 24 07 00 c6 83 a4 04 00 00 00 48 89 df e8 41 af fc ff 85 c0 41 89 c5 0f 84 6e 01 00 00 48 c7 c7 40 9a 71 c0 e8 ea f1 9d f3 <0f> 0b e9 46 fe ff ff 48 89 ef e8 fb f7 01 00 e9 12 ff ff ff c6 83
[ 9274.685157] RSP: 0018:ffffb536410dfe08 EFLAGS: 00010282
[ 9274.685175] RAX: 0000000000000000 RBX: ffff8ab884358760 RCX: 0000000000000006
[ 9274.685199] RDX: 0000000000000007 RSI: 0000000000000086 RDI: ffff8ab896f168b0
[ 9274.685223] RBP: ffff8ab884358f98 R08: ffffb53640000000 R09: 00000000000002c1
[ 9274.685246] R10: ffff8ab8928f8900 R11: ffffffffb59efe4d R12: ffff8ab8843593d0
[ 9274.687477] R13: 00000000ffffff92 R14: ffff8ab895aded80 R15: ffff8ab8843593d8
[ 9274.687503] FS:  0000000000000000(0000) GS:ffff8ab896f00000(0000) knlGS:0000000000000000
[ 9274.687527] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 9274.687548] CR2: 00005be8941db000 CR3: 000000000b20a003 CR4: 00000000001606e0
[ 9274.687574] Call Trace:
[ 9274.687614]  ieee80211_restart_work+0xbb/0xe0 [mac80211]
[ 9274.687637]  process_one_work+0x191/0x370
[ 9274.687715]  worker_thread+0x4f/0x3b0
[ 9274.687730]  kthread+0xf8/0x130
[ 9274.687745]  ? rescuer_thread+0x340/0x340
[ 9274.687758]  ? kthread_create_worker_on_cpu+0x70/0x70
[ 9274.687777]  ret_from_fork+0x35/0x40
[ 9274.687793] ---[ end trace 3adece76f5f16d5c ]---
[ 9274.689538] ------------[ cut here ]------------
[ 9274.689560] wls6:  Failed check-sdata-in-driver check, flags: 0x0
[ 9274.689625] WARNING: CPU: 1 PID: 1761 at /home/user/rpmbuild/BUILD/kernel-4.19.84/linux-4.19.84/net/mac80211/driver-ops.h:19 drv_remove_interface+0xf3/0x100 [mac80211]
[ 9274.689667] Modules linked in: iwldvm iwlwifi mac80211 cfg80211 ehci_pci ehci_hcd xt_nat ccm fuse nft_reject_ipv4 nft_reject nft_ct nf_tables nfnetlink ip6table_raw iptable_raw xen_netback xt_REDIRECT ip6table_filter ip6_tables xt_conntrack ipt_MASQUERADE iptable_nat nf_nat_ipv4 nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c joydev arc4 intel_rapl crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel intel_rapl_perf ttm pcspkr serio_raw e1000e drm_kms_helper ata_generic pata_acpi rfkill drm i2c_piix4 floppy u2mfn(O) xen_gntdev xen_gntalloc xen_blkback xenfs xen_evtchn xen_privcmd overlay xen_blkfront [last unloaded: cfg80211]
[ 9274.689868] CPU: 1 PID: 1761 Comm: kworker/1:1 Tainted: G        W  O      4.19.84-1.pvops.qubes.x86_64 #1
[ 9274.689896] Hardware name: Xen HVM domU, BIOS 4.8.5-12.fc25 11/13/2019
[ 9274.689931] Workqueue: events_freezable ieee80211_restart_work [mac80211]
[ 9274.689966] RIP: 0010:drv_remove_interface+0xf3/0x100 [mac80211]
[ 9274.692140] Code: 85 c0 75 e8 5b 5d 41 5c c3 48 8b b5 08 04 00 00 48 81 c5 28 04 00 00 48 c7 c7 20 77 71 c0 48 85 f6 48 0f 44 f5 e8 4d 3d a1 f3 <0f> 0b 5b 5d 41 5c c3 66 0f 1f 44 00 00 0f 1f 44 00 00 41 57 41 56
[ 9274.692198] RSP: 0000:ffffb536410dfc98 EFLAGS: 00010282
[ 9274.692217] RAX: 0000000000000000 RBX: ffff8ab893ba48c0 RCX: 0000000000000006
[ 9274.692242] RDX: 0000000000000007 RSI: 0000000000000086 RDI: ffff8ab896f168b0
[ 9274.692266] RBP: ffff8ab893ba4ce8 R08: ffffb53640000000 R09: 00000000000002dc
[ 9274.692290] R10: ffffb53640363d60 R11: ffffffffb59efe4d R12: ffff8ab884358760
[ 9274.692315] R13: ffff8ab884358760 R14: ffff8ab884358ef0 R15: ffff8ab893ba53a0
[ 9274.692340] FS:  0000000000000000(0000) GS:ffff8ab896f00000(0000) knlGS:0000000000000000
[ 9274.692364] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 9274.692384] CR2: 00007f1e53349f90 CR3: 000000000b20a003 CR4: 00000000001606e0
[ 9274.692409] Call Trace:
[ 9274.692447]  ieee80211_do_stop+0x4f9/0x860 [mac80211]
[ 9274.692482]  ieee80211_stop+0x16/0x20 [mac80211]
[ 9274.692503]  __dev_close_many+0xa1/0x110
[ 9274.692517]  dev_close_many+0x9f/0x160
[ 9274.692531]  dev_close.part.99+0x64/0xa0
[ 9274.692563]  cfg80211_shutdown_all_interfaces+0x43/0xd0 [cfg80211]
[ 9274.692601]  ieee80211_reconfig+0x8b/0x1140 [mac80211]
[ 9274.692631]  ieee80211_restart_work+0xbb/0xe0 [mac80211]
[ 9274.692653]  process_one_work+0x191/0x370
[ 9274.692670]  worker_thread+0x4f/0x3b0
[ 9274.692686]  kthread+0xf8/0x130
[ 9274.692702]  ? rescuer_thread+0x340/0x340
[ 9274.692717]  ? kthread_create_worker_on_cpu+0x70/0x70
[ 9274.692743]  ret_from_fork+0x35/0x40
[ 9274.692759] ---[ end trace 3adece76f5f16d5d ]---
[ 9274.693263] ------------[ cut here ]------------
[ 9274.693314] WARNING: CPU: 1 PID: 1761 at /home/user/rpmbuild/BUILD/kernel-4.19.84/linux-4.19.84/net/mac80211/driver-ops.c:39 drv_stop+0xff/0x110 [mac80211]
[ 9274.693355] Modules linked in: iwldvm iwlwifi mac80211 cfg80211 ehci_pci ehci_hcd xt_nat ccm fuse nft_reject_ipv4 nft_reject nft_ct nf_tables nfnetlink ip6table_raw iptable_raw xen_netback xt_REDIRECT ip6table_filter ip6_tables xt_conntrack ipt_MASQUERADE iptable_nat nf_nat_ipv4 nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c joydev arc4 intel_rapl crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel intel_rapl_perf ttm pcspkr serio_raw e1000e drm_kms_helper ata_generic pata_acpi rfkill drm i2c_piix4 floppy u2mfn(O) xen_gntdev xen_gntalloc xen_blkback xenfs xen_evtchn xen_privcmd overlay xen_blkfront [last unloaded: cfg80211]
[ 9274.698012] CPU: 1 PID: 1761 Comm: kworker/1:1 Tainted: G        W  O      4.19.84-1.pvops.qubes.x86_64 #1
[ 9274.698044] Hardware name: Xen HVM domU, BIOS 4.8.5-12.fc25 11/13/2019
[ 9274.698088] Workqueue: events_freezable ieee80211_restart_work [mac80211]
[ 9274.698126] RIP: 0010:drv_stop+0xff/0x110 [mac80211]
[ 9274.698147] Code: 48 8b 7d 08 48 83 c5 18 48 89 de e8 5b 16 56 f4 48 8b 45 00 48 85 c0 75 e7 e9 46 ff ff ff 48 c7 c7 a0 76 71 c0 e8 af 2b a7 f3 <0f> 0b 5b 5d c3 66 90 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00
[ 9274.698203] RSP: 0000:ffffb536410dfca0 EFLAGS: 00010286
[ 9274.698221] RAX: 0000000000000024 RBX: ffff8ab884358760 RCX: 0000000000000000
[ 9274.698245] RDX: 0000000000000000 RSI: ffff8ab896f168b8 RDI: ffff8ab896f168b8
[ 9274.698269] RBP: ffff8ab884358ff8 R08: ffffb53640000000 R09: 00000000000002fd
[ 9274.698293] R10: ffffb536410dfca8 R11: ffffffffb59efe4d R12: ffff8ab884358b90
[ 9274.698317] R13: ffff8ab884358760 R14: ffff8ab884358ef0 R15: ffff8ab893ba53a0
[ 9274.698342] FS:  0000000000000000(0000) GS:ffff8ab896f00000(0000) knlGS:0000000000000000
[ 9274.698367] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 9274.698389] CR2: 00007f1e51571548 CR3: 000000000b20a003 CR4: 00000000001606e0
[ 9274.698415] Call Trace:
[ 9274.698451]  ieee80211_do_stop+0x4e2/0x860 [mac80211]
[ 9274.698486]  ieee80211_stop+0x16/0x20 [mac80211]
[ 9274.698509]  __dev_close_many+0xa1/0x110
[ 9274.698526]  dev_close_many+0x9f/0x160
[ 9274.698542]  dev_close.part.99+0x64/0xa0
[ 9274.700409]  cfg80211_shutdown_all_interfaces+0x43/0xd0 [cfg80211]
[ 9274.700458]  ieee80211_reconfig+0x8b/0x1140 [mac80211]
[ 9274.700491]  ieee80211_restart_work+0xbb/0xe0 [mac80211]
[ 9274.700515]  process_one_work+0x191/0x370
[ 9274.700530]  worker_thread+0x4f/0x3b0
[ 9274.700544]  kthread+0xf8/0x130
[ 9274.700560]  ? rescuer_thread+0x340/0x340
[ 9274.700574]  ? kthread_create_worker_on_cpu+0x70/0x70
[ 9274.700593]  ret_from_fork+0x35/0x40
[ 9274.700609] ---[ end trace 3adece76f5f16d5e ]---
[ 9274.759827] IPv6: ADDRCONF(NETDEV_UP): wls6: link is not ready

@tetrahedras
Copy link

Creating a new sys-net does not appear to have fixed the issue, crashes still occur.

@w1k1n9cc
Copy link

I also face this issue sometimes but also with sys-usb. I also have a T450. I will post the the ouput of

sudo journalctl -u qubes-suspend

The next time it occurs.

@brycepg
Copy link

brycepg commented Oct 1, 2020

@andrewdavidwong Could you claim the bounty I made please? Apparently BountySource will take the bounty after 2 years which is 2 months away, and I want it to go to a Qubes member.


I've pretty much never had this issue after getting a new (used) T530 which supports HVM so I could install Qubes 4.0

@andrewdavidwong
Copy link
Member Author

andrewdavidwong commented Oct 1, 2020

@andrewdavidwong Could you claim the bounty I made please? Apparently BountySource will take the bounty after 2 years which is 2 months away, and I want it to go to a Qubes member.

Thanks for letting us know, @brycepg. I think the bounty should be turned into a donation to the Qubes OS Project, if possible. @mfc, @MiCh, do you know how to do that?

@andrewdavidwong
Copy link
Member Author

andrewdavidwong commented Oct 1, 2020

I've pretty much never had this issue after getting a new (used) T530 which supports HVM so I could install Qubes 4.0

I also have not experienced this bug in a very long time, even on the same hardware as when I filed this report. It looks like the last report was from @w1k1n9cc on May 24. @w1k1n9cc, are you still experiencing this?

@w1k1n9cc
Copy link

w1k1n9cc commented Oct 1, 2020

My Qubes-PC is not very active at the moment. I will try it until tuesday. Maybe I have some sparse time at the weekend.

@andrewdavidwong andrewdavidwong added R: duplicate Resolution: Another issue exists that is very similar to or subsumes this one. and removed needs diagnosis Requires technical diagnosis from developer. Replace with "diagnosed" or remove if otherwise closed. labels May 10, 2022
@quantumpacket
Copy link

@andrewdavidwong I think the reason everyone came here is that this issue was still open, and the other was closed. Now they are both closed. But the issue is clearly not fixed.

@DemiMarie DemiMarie reopened this May 10, 2022
@DemiMarie DemiMarie added needs diagnosis Requires technical diagnosis from developer. Replace with "diagnosed" or remove if otherwise closed. and removed R: duplicate Resolution: Another issue exists that is very similar to or subsumes this one. labels May 10, 2022
@andrewdavidwong
Copy link
Member Author

andrewdavidwong commented May 10, 2022

@DemiMarie, you're the one who asked whether this is a duplicate of #4042. Do you no longer have reason to think it is? If so, what are those reasons?

And please don't say "because the other one is closed." That would be a reason to reopen the other issue, not this one!

@DemiMarie
Copy link

Whoops!

@DemiMarie DemiMarie added R: duplicate Resolution: Another issue exists that is very similar to or subsumes this one. and removed needs diagnosis Requires technical diagnosis from developer. Replace with "diagnosed" or remove if otherwise closed. labels May 10, 2022
@DemiMarie DemiMarie reopened this May 10, 2022
@DemiMarie DemiMarie removed the R: duplicate Resolution: Another issue exists that is very similar to or subsumes this one. label May 10, 2022
@DemiMarie
Copy link

@andrewdavidwong #4042 affects sys-usb whie this one affects sys-net.

@tetrahedras
Copy link

Try disabling the /usr/lib/systemd/system/systemd-udevd.service watchdog in your sys-net template and please report back if it helps.

I tried disabling this service via systemd stop systemd-udevd && systemd disable systemd-udevd and it didn't fix the issue. Disabling the service via Qubes VM settings (adding an entry for systemd-udevd and unchecking the box) also did not help.

@DemiMarie
Copy link

Try disabling the /usr/lib/systemd/system/systemd-udevd.service watchdog in your sys-net template and please report back if it helps.

I tried disabling this service via systemd stop systemd-udevd && systemd disable systemd-udevd and it didn't fix the issue. Disabling the service via Qubes VM settings (adding an entry for systemd-udevd and unchecking the box) also did not help.

Disabling systemd-udevd will break a lot of stuff.

@3hhh
Copy link

3hhh commented May 11, 2022 via email

@tetrahedras
Copy link

tetrahedras commented May 12, 2022 via email

@3hhh
Copy link

3hhh commented May 12, 2022 via email

@tetrahedras
Copy link

Still experiencing this problem. In my case, it ONLY affects WiFi, Ethernet connections continue normally. See also #5508

@brycepg
Copy link

brycepg commented Jan 10, 2023

I haven't had this happen to be for a while. And I haven't had it happen to me at all since upgading to 4.1 (Lenovo T530). My sys-net is on fedora-35

@DemiMarie
Copy link

I haven't had this happen to be for a while. And I haven't had it happen to me at all since upgading to 4.1 (Lenovo T530). My sys-net is on fedora-35

Fedora 35 is EOL, FYI

@andrewdavidwong andrewdavidwong added the affects-4.1 This issue affects Qubes OS 4.1. label Aug 8, 2023
@andrewdavidwong andrewdavidwong removed this from the Release 4.1 updates milestone Aug 13, 2023
@andrewdavidwong andrewdavidwong added the eol-4.1 Closed because Qubes 4.1 has reached end-of-life (EOL) label Dec 7, 2024
Copy link

github-actions bot commented Dec 7, 2024

This issue is being closed because:

If anyone believes that this issue should be reopened, please leave a comment saying so.
(For example, if a bug still affects Qubes OS 4.2, then the comment "Affects 4.2" will suffice.)

@github-actions github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Dec 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
affects-4.1 This issue affects Qubes OS 4.1. bounty This issue has a public bounty associated with it. C: core eol-4.1 Closed because Qubes 4.1 has reached end-of-life (EOL) P: major Priority: major. Between "default" and "critical" in severity. T: bug Type: bug report. A problem or defect resulting in unintended behavior in something that exists.
Projects
None yet
Development

No branches or pull requests