Temporarily exclude known-failing tests #12849

rincebrain · 2021-12-13T13:28:28Z

Motivation and Context

Make the 20.04 runner not report failure every time until #12848 is resolved.

Description

"Yes, we know these might fail, don't scream about it."

How Has This Been Tested?

Not even slightly.

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Performance enhancement (non-breaking change which improves efficiency)
Code cleanup (non-breaking change which makes code smaller or more readable)
Breaking change (fix or feature that would cause existing functionality to change)
Library ABI change (libzfs, libzfs_core, libnvpair, libuutil and libzfsbootenv)
Documentation (a change to man pages or other documentation)

Checklist:

My code follows the OpenZFS code style requirements.
I have updated the documentation accordingly.
I have read the contributing document.
I have added tests to cover my changes.
I have run the ZFS Test Suite with this change applied.
All commit messages are properly formatted and contain Signed-off-by.

When restructing the zvol_open() logic for the Linux 5.13 kernel a lock inversion was accidentally introduced. In the updated code the spa_namespace_lock is now taken before the zv_suspend_lock allowing the following scenario to occur: down_read <=== waiting for zv_suspend_lock zvol_open <=== holds spa_namespace_lock __blkdev_get blkdev_get_by_dev blkdev_open ... mutex_lock <== waiting for spa_namespace_lock spa_open_common spa_open dsl_pool_hold dmu_objset_hold_flags dmu_objset_hold dsl_prop_get dsl_prop_get_integer zvol_create_minor dmu_recv_end zfs_ioc_recv_impl <=== holds zv_suspend_lock via zvol_suspend() zfs_ioc_recv ... This commit resolves the issue by moving the acquistion of the spa_namespace_lock back to after the zv_suspend_lock which restores the original ordering. Additionaly, as part of this change the error exit paths were simplified where possible. Signed-off-by: Brian Behlendorf <[email protected]> Issue openzfs#12849

When restructing the zvol_open() logic for the Linux 5.13 kernel a lock inversion was accidentally introduced. In the updated code the spa_namespace_lock is now taken before the zv_suspend_lock allowing the following scenario to occur: down_read <=== waiting for zv_suspend_lock zvol_open <=== holds spa_namespace_lock __blkdev_get blkdev_get_by_dev blkdev_open ... mutex_lock <== waiting for spa_namespace_lock spa_open_common spa_open dsl_pool_hold dmu_objset_hold_flags dmu_objset_hold dsl_prop_get dsl_prop_get_integer zvol_create_minor dmu_recv_end zfs_ioc_recv_impl <=== holds zv_suspend_lock via zvol_suspend() zfs_ioc_recv ... This commit resolves the issue by moving the acquisition of the spa_namespace_lock back to after the zv_suspend_lock which restores the original ordering. Additionally, as part of this change the error exit paths were simplified where possible. Signed-off-by: Brian Behlendorf <[email protected]> Issue openzfs#12849

tests/test-runner/bin/zts-report.py.in

Well, the CI being marked X for every run isn't that useful, so let's mark these as known failures until someone gets a chance to look at them... Signed-off-by: Rich Ercolani <[email protected]>

behlendorf · 2021-12-22T19:22:26Z

tests/test-runner/bin/zts-report.py.in

@@ -294,8 +298,6 @@ elif sys.platform.startswith('linux'):
        'mmp/mmp_active_import': ['FAIL', known_reason],
        'mmp/mmp_exported_import': ['FAIL', known_reason],
        'mmp/mmp_inactive_import': ['FAIL', known_reason],
-        'refreserv/refreserv_raidz': ['FAIL', known_reason],
-        'snapshot/rollback_003_pos': ['FAIL', known_reason],


Both of these exceptions were recently removed from master (#12897, #12898), so you can drop this hunk with a rebase. In fact, looking more carefully I see all three of the remaining tests are already included in the exceptions. So it seems like we could just close out this PR as no longer needed.

That said, I seem to be able to reliably reproduce the enospc_002_pos failure so I'm going to have a look at that.

behlendorf mentioned this pull request Dec 16, 2021

Fix zvol_open() lock inversion #12863

Merged

13 tasks

behlendorf added the Status: Code Review Needed Ready for review and testing label Dec 16, 2021

behlendorf reviewed Dec 20, 2021

View reviewed changes

tests/test-runner/bin/zts-report.py.in Outdated Show resolved Hide resolved

rincebrain force-pushed the known12848 branch 3 times, most recently from 1071fc6 to 1dd2993 Compare December 22, 2021 18:43

Temporarily exclude known-failing tests

149a16a

Well, the CI being marked X for every run isn't that useful, so let's mark these as known failures until someone gets a chance to look at them... Signed-off-by: Rich Ercolani <[email protected]>

rincebrain force-pushed the known12848 branch from 1dd2993 to 149a16a Compare December 22, 2021 19:09

behlendorf reviewed Dec 22, 2021

View reviewed changes

rincebrain closed this Dec 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Temporarily exclude known-failing tests #12849

Temporarily exclude known-failing tests #12849

rincebrain commented Dec 13, 2021

behlendorf Dec 22, 2021

Temporarily exclude known-failing tests #12849

Temporarily exclude known-failing tests #12849

Conversation

rincebrain commented Dec 13, 2021

Motivation and Context

Description

How Has This Been Tested?

Types of changes

Checklist:

behlendorf Dec 22, 2021

Choose a reason for hiding this comment