Issue 2094 #2109

ryao · 2014-02-05T22:22:49Z

This builds on previous work by @dweeezil in #2097 and should be ready to be merged.

Commit 1421c89 added a field to zbookmark_t that unintentinoally caused a disk format change. This negatively affected backward compatibility and platform portability. Therefore, this field is being removed. The function that field permitted is left unimplemented until a later patch that will reimplement the field in a way that does not affect the disk format. Signed-off-by: Richard Yao <[email protected]>

ZoL commit 1421c89 unintentionally changed the disk format in a forward-compatible, but not backward compatible way. This was accomplished by adding an entry to zbookmark_t, which is included in a couple of on-disk structures. That lead to the creation of pools with incorrect dsl_scan_phys_t objects that could only be imported by versions of ZFSOnLinux containing that commit. Such pools cannot be imported by other versions of ZFS. That includes past and future versions of ZoL. The additional field has been removed by a prior patch. This patch permits affected pools to be imported and repaired by implementing a workaround for the erratum. When this workaround is triggered, a kernel alert describing this will be printed to dmesg asking the system administrator to run a scrub. A subsequent scrub will replace the damaged dsl_scan_phys_t object, repairing the pool. Pools affected by the damaged dsl_scan_phys_t can be detected prior to an upgrade by running the following command as root: zdb -dddd poolname 1 | grep -P '^\t\tscan = ' | sed -e 's;scan = ;;' | wc -w Note that `poolname` must be replaced with the name of the pool you wish to check. A value of 25 indicates the dsl_scan_phys_t has been damaged. A value of 24 indicates that the dsl_scan_phys_t is normal. A value of 0 indicates that there has never been a scrub run on the pool. The regression caused by the change to zbookmark_t never made it into a tagged release or any Gentoo backports. Only those using HEAD were affected. However, this patch has a few limitations. There is no way to detect a damaged dsl_scan_phys_t object when it has occurred on 32-bit systems due to integer rounding that wrote incorrect information, but avoided the overflow on them. Correcting such issues requires triggering a scrub. In addition, bptree_entry_phys_t objects are also affected. These objects only exist during an asynchronous destroy and automating repair of damaged bptree_entry_phys_t objects is non-trivial. Any pools that have been imported by an affected version of ZoL must have all asynchronous destroy operations finish before export and subsequent import by a version containing this commit. Failure to do that will prevent pool import. The presence of any background destroys on any imported pools can be checked by running `zpool get freeing` as root. This will display a non-zero value for any pool with an active asynchronous destroy. Lastly, it is expected that no user data has been lost as a result of this erratum. Closes openzfs#2094 Original-patch-by: Tim Chase <[email protected]> Signed-off-by: Richard Yao <[email protected]>

dweeezil · 2014-02-05T23:50:38Z

Other than the note I just sent, looks good to me. These instructions should be in the release notes for 0.6.3 (we do have release notes, don't we :)

behlendorf · 2014-02-06T00:14:16Z

These instructions should be in the release notes for 0.6.3

I'm conflicted about this point. This will never be an issue for people who have only run the tagged releases and I've only written up release notes for the tags.

(we do have release notes, don't we :)

I usually write up something when I tag. Although I/we could do a lot better and commit them to the tree.

dweeezil · 2014-02-06T13:16:13Z

What about adding a flag to the spa struct that would get scooped up here:

--- a/module/zfs/spa_misc.c
+++ b/module/zfs/spa_misc.c
@@ -1796,6 +1796,9 @@ spa_scan_get_stats(spa_t *spa, pool_scan_stat_t *ps)
        ps->pss_pass_start = spa->spa_scan_pass_start;
        ps->pss_pass_exam = spa->spa_scan_pass_exam;

+       /* zol scan errata flag */
+       ps->pss_zol_errata = spa->spa_scan_zol_errata;
+
        return (0);
 }

and then generate a message here:

--- a/cmd/zpool/zpool_main.c
+++ b/cmd/zpool/zpool_main.c
@@ -3909,6 +3909,13 @@ print_scan_status(pool_scan_stat_t *ps)

        assert(ps->pss_func == POOL_SCAN_SCRUB ||
            ps->pss_func == POOL_SCAN_RESILVER);
+
+       /*
+        * zol errata
+        */
+       if (ps->ps_zol_errata)
+               printf("... something about zol errata...
+
        /*
         * Scan is finished or canceled.
         */

behlendorf · 2014-02-06T18:07:57Z

@dweeezil I think that's a nice solution. It alleviates my concerns about an end user never knowing they need to do a scrub, and hopefully @ryao's concerns about doing it automatically. Perhaps something like this (untested):

diff --git a/cmd/zpool/zpool_main.c b/cmd/zpool/zpool_main.c
index a856fd4..9b4b904 100644
--- a/cmd/zpool/zpool_main.c
+++ b/cmd/zpool/zpool_main.c
@@ -3903,6 +3903,19 @@ print_scan_status(pool_scan_stat_t *ps)
                return;
        }

+       /*
+        * Scan required, https://github.com/zfsonlinux/zfs/issues/2094.
+        */
+       if ((ps->pss_errata & POOL_ERRATA_2094) &&
+           (ps->pss_state == DSS_FINISHED || ps->pss_state == DSS_CANCELED)) {
+               (void) printf(gettext("detected pool compatibility issue\n"));
+               (void) printf(gettext(
+                   "   see: https://github.com/zfsonlinux/zfs/issues/2094\n"));
+               (void) printf(gettext
+                   "action: To correct the issue run 'zpool scrub'.\n");
+               return;
+       }
+
        start = ps->pss_start_time;
        end = ps->pss_end_time;
        zfs_nicenum(ps->pss_processed, processed_buf, sizeof (processed_buf));

Which should show the end user a message like this when they run zpool status. This is the same format as all the other see/action messages end users may get.

  pool: rpool
 state: ONLINE
  scan: detected pool compatibility issue
   see: https://github.com/zfsonlinux/zfs/issues/2094
action: To correct the issue run 'zpool scrub'.
config:
   ...

dweeezil · 2014-02-06T18:40:34Z

@behlendorf Yes, that's exactly what I had in mind.

What are your thoughts on the need to handle the hopefully very rare situation where an async destroy is in progress? It seems we'd have to fail the import in that case. We could look at the new spa flag following the dsl_scan_init() call at the bottom of dsl_pool_open and bail if the earlier async destroy stuff had been triggered. Presumably just about any error we return will be propagated back as some sort of un-importable pool failure.

behlendorf · 2014-02-06T18:47:10Z

@dweeezil I think we shouldn't try to handle the async destroy case which I expect will be very rare.
I think we just want to ensure the import cleanly fails in that case.

For those few (if any) users who are be impacted they can just continue to run the version of the code they had until the async destroy completes. Once done they should be able to update their code run then run a scrub.

I'd prefer to avoid inventing more machinery for this than we absolutely need too.

dweeezil · 2014-02-06T18:54:44Z

@behlendorf I agree.

ryao · 2014-02-06T18:57:45Z

@behlendorf @dweeezil We can partially handle the async_destroy case by checking if the feature is activated when we detect the corrupt object in dsl_scan_init(). There is already a check there, so we simply need to hook into it. This works because the feature is deactivated whenever an async_destroy finishes.

behlendorf · 2014-02-10T23:51:47Z

@ryao Do you have an updated for this this? I'd like to push this over the finish line this week, I can propose the patch if your swamped.

behlendorf · 2014-02-11T05:55:39Z

Closed in favor of #2112

ryao added 2 commits February 5, 2014 17:20

dweeezil mentioned this pull request Feb 5, 2014

Fix generation of non-standard "scan" in object directory. #2097

Closed

behlendorf added this to the 0.6.3 milestone Feb 6, 2014

behlendorf mentioned this pull request Feb 11, 2014

Full Issue 2094 Fix #2112

Closed

behlendorf closed this Feb 11, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue 2094 #2109

Issue 2094 #2109

ryao commented Feb 5, 2014

dweeezil commented Feb 5, 2014

behlendorf commented Feb 6, 2014

dweeezil commented Feb 6, 2014

behlendorf commented Feb 6, 2014

dweeezil commented Feb 6, 2014

behlendorf commented Feb 6, 2014

dweeezil commented Feb 6, 2014

ryao commented Feb 6, 2014

behlendorf commented Feb 10, 2014

behlendorf commented Feb 11, 2014

Issue 2094 #2109

Issue 2094 #2109

Conversation

ryao commented Feb 5, 2014

dweeezil commented Feb 5, 2014

behlendorf commented Feb 6, 2014

dweeezil commented Feb 6, 2014

behlendorf commented Feb 6, 2014

dweeezil commented Feb 6, 2014

behlendorf commented Feb 6, 2014

dweeezil commented Feb 6, 2014

ryao commented Feb 6, 2014

behlendorf commented Feb 10, 2014

behlendorf commented Feb 11, 2014