Possibility of implementing RAID mode using erasure codes #558

Rudd-O · 2012-02-04T18:08:22Z

What would be the possilibyt of doing this within ZFS?

http://www.networkcomputing.com/deduplication/229500204?pgno=2

If the data is erasure-coded into N shares distributed across at least H distinct storage units, then the data can be recovered from any K of these units -- therefore only the failure of H-K+1 units can make the data unavailable.

This means, for example, that a storage server using the erasure codes equivalent of RAID1 across twelve disks, can survive ANY SIX disks dying. In conventional RAID1, the loss of one leg (only two disks) is enough to offline the array.

Does ZFS layering get in the way, or is it possible to, at least in principle, implement it?

Rudd-O · 2012-02-04T18:15:34Z

relevant http://bigasterisk.com/tahoe-playground/

behlendorf · 2012-02-07T19:33:00Z

Without investigating this too carefully I suspect it would be possible for ZFS to implement this. The various redundancy layouts used by zfs (mirrors, raidz) are very modular. Adding another one for erasure codes or say distributed parity should be do able, it's just a matter of implementing those policies. See module/zfs/vdev_mirror.c and module/zfs/vdev_raidz for the nuts of bolts of how they are implemented.

behlendorf · 2018-05-29T23:39:36Z

This was accidentally closed. However, I'm going to leave it closed because a version of this kind of functionality is being implemented in #3497.

thegreatgazoo · 2018-05-30T20:20:55Z

The raidz vdev driver already used a form of Reed Solomon code, see the comments under the license header in https://github.com/zfsonlinux/zfs/blob/master/module/zfs/vdev_raidz.c

But raidz vdev driver supports only up to triple parity, which isn't a limit of Reed Solomon. The draid vdev driver reuses the raidz parity code, so it can't do anything over triple parity either.

HW-accelerated EC library would be key to performance. There's plenty of userspace libraries, e.g. Intel ISA-L, but I'm not sure if any exists in the kernel.

gmelikov · 2018-06-21T13:50:43Z

If @behlendorf won't mind, I'll reopen this feature request. As @thegreatgazoo stated, draid don't introduce the main part of this request - more than any 3 drives failure.

DeHackEd · 2018-06-21T14:01:22Z

Indeed, erasure codes would allow for arbitrary RAID-Zx for any (sane) value of x.

From a practical standpoint though, very wide RAID-Z arrays tend to perform poorly so users are discouraged from making them too large, even without the CPU penalty of generic erasure codes. I'm worried about the practical implications of allowing arbitrary arrays. 30 disks with 6 parity drives is not going to perform well for many workloads.

PrivatePuffin · 2019-11-07T10:24:28Z

across at least H distinct storage units
As long as this isn't implemented, I personally don't see much use case...

And isn't their implementation of erasure-codes one of the reasons (even 1 node) CEPH arrays with erasure code on, are awkwardly slow? Including rebuilds? How would this work out in combination with draid (which is actually focussed on fixing some current performance bottlenecks)?

Maybe it would be worthwhile if someone paints a professional usecase where this feature is a must and describe the consequences of it not currently being an option?

It looks to me like it's sortoff a solution looking for a (very niche) problem...

…aster Merge remote-tracking branch '6.0/stage' into 'master'

behlendorf added the Difficulty - Hard label Oct 6, 2014

behlendorf removed this from the 0.7.0 milestone Oct 6, 2014

behlendorf removed the Difficulty - Hard label Oct 5, 2016

behlendorf closed this as completed in c60a51b May 29, 2018

gmelikov reopened this Jun 21, 2018

pcd1193182 pushed a commit to pcd1193182/zfs that referenced this issue Sep 26, 2023

Merge pull request openzfs#558 from delphix/projects/merge-upstream/m…

440b500

…aster Merge remote-tracking branch '6.0/stage' into 'master'

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possibility of implementing RAID mode using erasure codes #558

Possibility of implementing RAID mode using erasure codes #558

Rudd-O commented Feb 4, 2012

Rudd-O commented Feb 4, 2012

behlendorf commented Feb 7, 2012

behlendorf commented May 29, 2018

thegreatgazoo commented May 30, 2018

gmelikov commented Jun 21, 2018

DeHackEd commented Jun 21, 2018

PrivatePuffin commented Nov 7, 2019

Possibility of implementing RAID mode using erasure codes #558

Possibility of implementing RAID mode using erasure codes #558

Comments

Rudd-O commented Feb 4, 2012

Rudd-O commented Feb 4, 2012

behlendorf commented Feb 7, 2012

behlendorf commented May 29, 2018

thegreatgazoo commented May 30, 2018

gmelikov commented Jun 21, 2018

DeHackEd commented Jun 21, 2018

PrivatePuffin commented Nov 7, 2019