Standard fsck isn't run on VM startup #979

nrgaway · 2015-04-29T23:31:33Z

/rw is not mounted in the standard way from /etc/fstab, because of for example DispVM, which does not have /rw mounted at all. This is also the reason why standard fsck isn't run on VM startup.

Possible solutions:
Add feature to qubes-manager to:

Show if there is disk error
2 .Check disk
3 .Repair disk

The text was updated successfully, but these errors were encountered:

marmarek · 2015-08-05T22:34:05Z

Currently if one want to run fsck on /rw, it requires:

Adding single parameter to kernel command line (qvm-prefs -s VMNAME kernelopts single)
Starting VM - it would timeout on qrexec connection, but that's ok.
Access VM console using sudo xl console VMNAME
Get shell access (just press enter when prompted for root password)
Run fsck on /dev/xvdb (/rw): fsck -f /dev/xvdb
Shutdown the VM - poweroff from that shell
Restore default kernel command line: qvm-prefs -s VMNAME kernelopts default

marmarek · 2015-08-26T21:31:11Z

We should go with running standard fsck on VM startup. If it isn't desirable for some VM (which doesn't have /rw at all - DispVM), this could be addressed in .mount unit file with some condition.

cfcs · 2015-08-27T14:49:35Z

Alternatively you can run fsck on private.img from dom0 (if you trust fsck to not have security flaws, which is grantedly a risky assumption). Either way it's a required step for shrinking/compacting appvm filesystems to free up space.

Rudd-O · 2016-10-23T17:50:27Z

The agent really should detect if there is an error in the kernel ringbuffer or journald log, and submit that error for dom0 to display it in qubes-manager or with a notification.

But this will not cover the case of a system entirely failing to boot. In that case, it's possible that monitoring the console log in dom0 can provide a mechanism that lets the user know (qubes-manager or notification) that the VM is not booting properly, MUCH, much faster than just waiting for a timeout and then just dying.

v6ak · 2019-07-30T06:00:00Z

Hello,
I am not sure if we should reopen this issue or create a new one. This issue does not seem to be fully fixed. There is some interesting part of journalctl output:

Jul 30 07:48:53 localhost mount-dirs.sh[294]: Private device management: checking /dev/xvdb
Jul 30 07:48:53 localhost mount-dirs.sh[294]: Private device management: checking /dev/xvdb
Jul 30 07:48:53 localhost systemd[1]: Started udev Kernel Device Manager.
Jul 30 07:48:53 localhost mount-dirs.sh[294]: Private device management: fsck.ext4 /dev/xvdb failed:
Jul 30 07:48:53 localhost mount-dirs.sh[294]: /dev/xvdb contains a file system with errors, check forced.
Jul 30 07:48:53 localhost mount-dirs.sh[294]: /dev/xvdb: Inodes that were part of a corrupted orphan linked list found.
Jul 30 07:48:53 localhost mount-dirs.sh[294]: /dev/xvdb: UNEXPECTED INCONSISTENCY; RUN fsck MANUALLY.
Jul 30 07:48:53 localhost mount-dirs.sh[294]:         (i.e., without -a or -p options)
Jul 30 07:48:53 localhost resize-rootfs-if-needed.sh[293]: dumpe2fs 1.44.5 (15-Dec-2018)
Jul 30 07:48:53 localhost kernel: EXT4-fs (xvdb): warning: mounting fs with errors, running e2fsck is recommended
Jul 30 07:48:53 localhost kernel: EXT4-fs (xvdb): mounted filesystem with ordered data mode. Opts: discard
Jul 30 07:48:53 localhost resize-rootfs-if-needed.sh[293]: root filesystem already at 31041496 blocks
Jul 30 07:48:53 localhost systemd[1]: Started Adjust root filesystem size.
Jul 30 07:48:53 localhost mount-dirs.sh[294]: Checking /rw
Jul 30 07:48:53 localhost mount-dirs.sh[294]: Private device size management: enlarging /dev/xvdb
Jul 30 07:48:53 localhost mount-dirs.sh[294]: Private device size management: resize2fs /dev/xvdb failed:
Jul 30 07:48:53 localhost mount-dirs.sh[294]: resize2fs 1.44.5 (15-Dec-2018)
Jul 30 07:48:53 localhost mount-dirs.sh[294]: resize2fs: Permission denied to resize filesystem
Jul 30 07:48:53 localhost mount-dirs.sh[294]: Filesystem at /dev/xvdb is mounted on /rw; on-line resizing required
Jul 30 07:48:53 localhost mount-dirs.sh[294]: old_desc_blocks = 2, new_desc_blocks = 2
Jul 30 07:48:53 localhost kernel: EXT4-fs warning (device xvdb): ext4_resize_begin:46: There are errors in the filesystem, so online resizing is not allowed

So, the automatic way has failed. I am not notified (unless I read journalctl after every boot) and I am not sure how to do it manually:

$ qvm-block attach disp2095 dom0:/dev/qubes_dom0/vm-XXXXX-private
qvm-block: error: backend vm 'dom0' doesn't expose device '/dev/qubes_dom0/vm-XXXXX-private'

How has this happened? I guess this is related to having a full LVM thin provisioned pool in the past.

andrewdavidwong · 2023-04-07T22:37:55Z

@v6ak, @nrgaway: Is this still a problem on 4.1?

github-actions · 2024-12-07T14:23:34Z

This issue is being closed because:

This issue is believed to affect only Qubes OS 4.1 (and possibly earlier).
Qubes OS 4.1 has reached end-of-life (EOL).

If anyone believes that this issue should be reopened, please leave a comment saying so.
(For example, if a bug still affects Qubes OS 4.2, then the comment "Affects 4.2" will suffice.)

marmarek added T: enhancement Type: enhancement. A new feature that does not yet exist or improvement of existing functionality. C: core P: minor Priority: minor. The lowest priority, below "default." labels May 12, 2015

marmarek added this to the Release 3.1 milestone May 12, 2015

This was referenced Aug 26, 2015

Qubes /usr/local symlink /rw/usrlocal AppArmor issue #1122

Closed

Bind mount /rw/usrlocal -> /usr/local instead of symlink #1150

Closed

marmarek modified the milestones: Release 3.2, Release 3.1 Feb 8, 2016

marmarek mentioned this issue Mar 29, 2016

Web page with list of wanted maintainers/developers/others #1700

Closed

marmarek modified the milestones: Release 3.2, Release 4.0 Aug 5, 2016

marmarek mentioned this issue Oct 22, 2016

errors in prepare-dispvm, cleanup needed #2390

Closed

marmarek closed this as completed in marmarek/old-qubes-core-agent-linux@9f9c3c5 Mar 7, 2017

andrewdavidwong reopened this Jul 31, 2019

Rot127 mentioned this issue Jul 3, 2021

Ubuntu template does not boot after LVM pool was on 100% #6759

Closed

4 tasks

andrewdavidwong modified the milestones: Release 4.0, Release 4.0 updates Feb 5, 2022

andrewdavidwong modified the milestones: Release 4.0 updates, Release TBD Apr 7, 2023

andrewdavidwong added T: bug Type: bug report. A problem or defect resulting in unintended behavior in something that exists. and removed T: enhancement Type: enhancement. A new feature that does not yet exist or improvement of existing functionality. labels Apr 7, 2023

andrewdavidwong modified the milestones: Release TBD, Release 4.1 updates Apr 7, 2023

andrewdavidwong added the affects-4.1 This issue affects Qubes OS 4.1. label Aug 8, 2023

andrewdavidwong removed this from the Release 4.1 updates milestone Aug 13, 2023

andrewdavidwong added the eol-4.1 Closed because Qubes 4.1 has reached end-of-life (EOL) label Dec 7, 2024

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Dec 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Standard fsck isn't run on VM startup #979

Standard fsck isn't run on VM startup #979

nrgaway commented Apr 29, 2015

marmarek commented Aug 5, 2015

marmarek commented Aug 26, 2015

cfcs commented Aug 27, 2015

Rudd-O commented Oct 23, 2016

v6ak commented Jul 30, 2019

andrewdavidwong commented Apr 7, 2023

github-actions bot commented Dec 7, 2024

Standard fsck isn't run on VM startup #979

Standard fsck isn't run on VM startup #979

Comments

nrgaway commented Apr 29, 2015

marmarek commented Aug 5, 2015

marmarek commented Aug 26, 2015

cfcs commented Aug 27, 2015

Rudd-O commented Oct 23, 2016

v6ak commented Jul 30, 2019

andrewdavidwong commented Apr 7, 2023

github-actions bot commented Dec 7, 2024