PAX: size overflow detected in function zil_itx_create #2505

mrobbetts · 2014-07-17T16:46:34Z

Since updating to 0.6.3 (also kernel 3.14.5-hardened on Gentoo) I am seeing some of these in my dmesg:

Jul 17 09:39:15 copper kernel: [126911.127543] PAX: size overflow detected in function zil_itx_create /var/tmp/portage/sys-fs/zfs-kmod-0.6.3/work/zfs-zfs-0.6.3/module/zfs/../../module/zfs/zil.c:1179 cicus.137_21 min, count: 2
Jul 17 09:39:15 copper kernel: [126911.127553] CPU: 0 PID: 11210 Comm: rsync Tainted: P           O 3.14.5-hardened-r2 #4
Jul 17 09:39:15 copper kernel: [126911.127557] Hardware name: HP ProLiant MicroServer, BIOS O41     07/29/2011
Jul 17 09:39:15 copper kernel: [126911.127560]  ffff88025f7c1180 ffffc90085a97c18 ffffffff817c065e ffff88041fc0db20
Jul 17 09:39:15 copper kernel: [126911.127565]  0000000000000048 ffffc90085a97c28 ffffffff8111e8b3 ffffc90085a97c48
Jul 17 09:39:15 copper kernel: [126911.127570]  ffffffffa031fdc6 ffff8802da6cf078 ffff88025f7c1180 ffffc90085a97c98
Jul 17 09:39:15 copper kernel: [126911.127574] Call Trace:
Jul 17 09:39:15 copper kernel: [126911.127585]  [<ffffffff817c065e>] dump_stack+0x46/0x5e
Jul 17 09:39:15 copper kernel: [126911.127592]  [<ffffffff8111e8b3>] report_size_overflow+0x24/0x2e
Jul 17 09:39:15 copper kernel: [126911.127617]  [<ffffffffa031fdc6>] zil_itx_create+0x48/0x9d [zfs]
Jul 17 09:39:15 copper kernel: [126911.127639]  [<ffffffffa030c2b7>] zfs_log_remove+0x80/0xca [zfs]
Jul 17 09:39:15 copper kernel: [126911.127658]  [<ffffffffa0315ffe>] zfs_remove+0x311/0x3b5 [zfs]
Jul 17 09:39:15 copper kernel: [126911.127663]  [<ffffffff817c7b2a>] ? _raw_spin_lock+0x9/0x11
Jul 17 09:39:15 copper kernel: [126911.127680]  [<ffffffffa0329192>] zpl_fallocate_common+0x485/0x5f9 [zfs]
Jul 17 09:39:15 copper kernel: [126911.127685]  [<ffffffff81126db4>] vfs_unlink+0x92/0xeb
Jul 17 09:39:15 copper kernel: [126911.127689]  [<ffffffff81126f5e>] do_unlinkat+0x151/0x265
Jul 17 09:39:15 copper kernel: [126911.127695]  [<ffffffff811292c7>] SyS_unlink+0x11/0x19
Jul 17 09:39:15 copper kernel: [126911.127699]  [<ffffffff817ce89e>] system_call_fastpath+0x16/0x1b

This is obviously a 'hardened' (64-bit) kernel with PaX enabled. I saw it most recently when I tried to eix-sync my Gentoo installation (with /usr/portage/ stored on a zfs file system). That errored out with:

...
receiving incremental file list
rsync: writefd_unbuffered failed to write 8 bytes to message fd [receiver]: Broken pipe (32)
rsync error: error in rsync protocol data stream (code 12) at io.c(1532) [receiver=3.0.9]

Is this a problem with ZoL, or my configuration?

The text was updated successfully, but these errors were encountered:

behlendorf · 2014-07-18T00:02:00Z

@mrobbetts PAX may be reporting a legitimate issue here. However, the code it's complaining about looks reasonable to me and I don't immediately see an overflow. Perhaps @ryao can better interpret the PAX warning.

chrisrd · 2014-07-18T00:54:06Z

This looks like a reasonable explanation of the PAX stuff:

https://forums.grsecurity.net/viewtopic.php?f=7&t=3043

I also don't see how it could overflow, everything is a constant (sizeof() or offsetof()) apart from the strlen(name), and even if name somehow isn't null terminated there's no way there won't be a null byte somewhere within size_t bytes:

void
zfs_log_remove(zilog_t *zilog, dmu_tx_t *tx, uint64_t txtype,
        znode_t *dzp, char *name, uint64_t foid)
{
   size_t namesize = strlen(name) + 1;
   itx = zil_itx_create(txtype, sizeof (*lr) + namesize);
}

itx_t *
zil_itx_create(uint64_t txtype, size_t lrsize)
{       
   lrsize = P2ROUNDUP_TYPED(lrsize, sizeof (uint64_t), size_t);
   itx = kmem_alloc(offsetof(itx_t, itx_lr) + lrsize,
      KM_PUSHPAGE | KM_NODEBUG);
}

Perhaps the P2ROUNDUP_TYPED macro is confusing the overflow detector due to sign extensions or something?

#define P2ROUNDUP_TYPED(x, align, type) \
        (-(-(type)(x) & -(type)(align)))

ryao · 2014-07-18T01:45:19Z

@behlendorf I would need to reproduce it locally. I will not have time to do that until the weekend at the earliest.

chrisrd · 2014-07-18T09:27:59Z

Looks like a PAX and/or gcc (4.7.2) optimisation bug.

A test program exhibits the PAX error when compiled w/ -O, and has no error without -O:

$ gcc -fplugin=linux/tools/gcc/size_overflow_plugin/size_overflow_plugin.so test.c -o test -O 2> /dev/null && ./test
SIZE_OVERFLOW: size overflow detected in function main test.c:25 cicus.6_14 min, count: 2

$ gcc -fplugin=linux/tools/gcc/size_overflow_plugin/size_overflow_plugin.so test.c -o test 2> /dev/null && ./test
exit ok

The test program, based on the examples in https://forums.grsecurity.net/viewtopic.php?f=7&t=3043 :

#include <stdint.h>
#include <stdio.h>

extern void *malloc(size_t size) __attribute__((size_overflow(1)));

void * __attribute__((size_overflow(1))) coolmalloc(size_t size)
{
        return malloc(size);
}

void report_size_overflow(const char *file, unsigned int line, const char *func, const char *ssa_name)
{
        printf("SIZE_OVERFLOW: size overflow detected in function %s %s:%u %s", func, file, line, ssa_name);
        fflush(stdout);
        _exit(1);
}

#define P2ROUNDUP_TYPED(x, align, type) \
        (-(-(type)(x) & -(type)(align)))

int main(int argc, char *argv[])
{
  size_t namesize = strlen(argv[0]);
  namesize = P2ROUNDUP_TYPED(namesize, sizeof (uint64_t), size_t);
  coolmalloc(20 + namesize);
}

Briefly, to build the size_overflow_plugin.so gcc plugin:

git checkout v3.14.12
git checkout -b 3.14.12-pax
wget http://grsecurity.net/stable/grsecurity-3.0-3.14.12-201407170638.patch
patch -p1 < grsecurity-3.0-3.14.12-201407170638.patch
make oldconfig  # enable the PAX stuff, in particular CONFIG_PAX_SIZE_OVERFLOW
make  # can kill it once it's done with tools/gcc/size_overflow_plugin/

chrisrd · 2014-07-21T23:08:14Z

FYI, reported to the PAX/grsecurity development forum:

User, code, gcc or grsecurity error?
https://forums.grsecurity.net/viewtopic.php?f=1&t=4016

...no response as yet.

chrisrd · 2014-07-28T07:26:54Z

The response from "PaX Team":

this is a sort of false positive vs. 'bad' code as it relies on integer overflow to produce the desired results (e.g., the unary minus operator makes no sense on an unsigned type since the resulting negative value cannot be represented in the unsigned type and it's only the C integer conversion rules that make this work but that doesn't make the code any more readable/nice/etc). in situations like this we recommend rewriting the 'smart' code to a form that doesn't rely on overflow.

I.e. it's a limitation (or maybe a deliberate design choice) of the PAX overflow detector rather than a problem in the ZoL code (unless you consider poor code clarity a bug).

mrobbetts · 2014-07-30T16:31:28Z

@chrisrd Thanks for looking into this for me.

Would you recommend simply disabling PaX for use with ZoL, then? I've done that in the mean time, and everything appears to be working correctly.

chrisrd · 2014-07-31T00:00:13Z

@mrobbetts I guess it depends on how strongly you feel about using a PaX hardened kernel. Disabling PaX would be easiest. Selectively turning off PaX for the ZoL compile, if possible, could be an option. @ryao likely has a better idea than myself.

Overall, I'm not sure what the best long term solution would be. As pointed out by "PaX Team" in the grsecurity link above, there's an argument the P2ROUNDUP_TYPED and related macros should be modified to not do "tricky stuff" (or at least not do tricky stuff that trips up grsecurity). On the other hand, the "tricky stuff" is using standards-defined C behaviour so there's an argument this is a bug in grsecurity's overflow detector and should be fixed there. On the third hand, "Pax Team" has provided an alternative implementation of P2ROUNDUP_TYPED (similar work would probably need to be done for the related macros) which likely won't affect performance to any measurable degree so the issue can be addressed in ZoL.

In the end it's going to boil down to who cares the most: personally I don't use a PaX hardened kernel so it's not an issue for me. Someone on the PaX side using ZoL might be encouraged to look into fixing it in PaX, or someone on the ZoL side who's interested in using PaX might fix it here (@ryao, hint, hint :-) ).

rennhak · 2014-11-28T20:54:40Z

What is the best way to solve this in the meantime? Fallback to 0.6.2?

rennhak · 2014-11-30T01:30:52Z

A build with 0.6.2 and Linux 3.16.5 and PaX/Grsec fails with

Makefile:973: recipe for target 'fs' failed
make[2]: *** [fs] Error 2

rennhak · 2014-12-01T18:23:24Z

Information for anyone stumbling over this. 0.6.2 does not work on 3.16.5 or later. For that you have to use 0.6.3 or later.

gcs-github · 2014-12-09T14:17:18Z

As a sysadmin, I'm of the opinion that we should be able to use PaX with ZFS. I administer machines using ZFS with machines using PaX, and I think it's unfortunate that we'd have to choose between the added reliability of ZFS and the added security of PaX.

How can I help?

chrisrd · 2014-12-09T23:16:23Z

@gcs-github If you have C experience, I'd recommend looking at implementing the alternative P2ROUNDUP_TYPED and associated macros as recommended by PaX Team upthread.

Or, possibly even better, see if the linux source has any "round up" macros/functions that can be used instead - I'd be pretty surprised if it doesn't!

Of some related interest, the issue of detecting integer overflows is being discussed on LKML:

http://thread.gmane.org/gmane.linux.kernel/1838832

gcs-github · 2014-12-10T11:34:01Z

@chrisrd Sure, I'll look into it! :)

That'll be the first time for me getting inside the code of ZFS (or even kernel-related software in general). Do you have any pointers to set up a proper dev environment and the proper test suite for it?

chrisrd · 2014-12-10T23:43:25Z

@gcs-github Sorry, not really. Prior to actually testing inside the kernel (which would be better done in a virtual machine) you should use ztest: see cmd/ztest in the ZFS repository. I'd be looking for existing stuff in the linux source before writing anything, but if it's necessary to actually write code, I'd be writing an external test harness to exercise the rounding and interaction with PaX before dropping it into ZFS.

N8Fear · 2014-12-11T08:13:01Z

Just as a pointer: there is the #define round_up(x, y) (and round_down) macro in include/linux/kernel.h. This macro is used in the grsecurity patch (just greped for it: it actually was there before, but wasn't replaced by something different), so I guess it's implementation should be pax/grsec-compatible.

alexxy · 2014-12-11T14:54:35Z

I get same issue

zfs is 0.6.3 with patches (gentoo version by ryao)

uname -a

Linux mdstor 3.17.6-hardened #2 SMP Thu Dec 11 14:15:29 MSK 2014 x86_64 Intel(R) Xeon(R) CPU E5-2660 v2 @ 2.20GHz GenuineIntel GNU/Linux

LANG=C gcc -v

Using built-in specs.
COLLECT_GCC=/usr/x86_64-pc-linux-gnu/gcc-bin/4.8.3/gcc
COLLECT_LTO_WRAPPER=/usr/libexec/gcc/x86_64-pc-linux-gnu/4.8.3/lto-wrapper
Target: x86_64-pc-linux-gnu
Configured with: /var/tmp/portage/sys-devel/gcc-4.8.3/work/gcc-4.8.3/configure --host=x86_64-pc-linux-gnu --build=x86_64-pc-linux-gnu --prefix=/usr --bindir=/usr/x86_64-pc-linux-gnu/gcc-bin/4.8.3 --includedir=/usr/lib/gcc/x86_64-pc-linux-gnu/4.8.3/include --datadir=/usr/share/gcc-data/x86_64-pc-linux-gnu/4.8.3 --mandir=/usr/share/gcc-data/x86_64-pc-linux-gnu/4.8.3/man --infodir=/usr/share/gcc-data/x86_64-pc-linux-gnu/4.8.3/info --with-gxx-include-dir=/usr/lib/gcc/x86_64-pc-linux-gnu/4.8.3/include/g++-v4 --with-python-dir=/share/gcc-data/x86_64-pc-linux-gnu/4.8.3/python --enable-languages=c,c++ --enable-obsolete --enable-secureplt --disable-werror --with-system-zlib --enable-nls --without-included-gettext --enable-checking=release --with-bugurl=https://bugs.gentoo.org/ --with-pkgversion='Gentoo Hardened 4.8.3 p1.1, pie-0.5.9' --enable-esp --enable-libstdcxx-time --enable-shared --enable-threads=posix --enable-__cxa_atexit --enable-clocale=gnu --disable-multilib --with-multilib-list=m64 --disable-altivec --disable-fixed-point --enable-targets=all --disable-libgcj --enable-libgomp --disable-libmudflap --disable-libssp --disable-libquadmath --enable-lto --without-cloog --disable-libsanitizer
Thread model: posix
gcc version 4.8.3 (Gentoo Hardened 4.8.3 p1.1, pie-0.5.9)

[ 59.975090] PAX: size overflow detected in function zil_itx_create /var/tmp/portage/sys-fs/zfs-kmod-0.6.3-r1/work/zfs-zfs-0.6.3/module/zfs/../../module/zfs/zil.c:1187 cicus.150_20 min, count: 2
[ 59.975095] CPU: 12 PID: 3252 Comm: touch Tainted: P O 3.17.6-hardened #2
[ 59.975096] Hardware name: Supermicro X9DRW/X9DRW, BIOS 3.0c 03/24/2014
[ 59.975098] ffff88105002f900 ffffc90031ebbac0 ffffffffa5ae1c35 0000000000000070
[ 59.975101] ffffc90031ebbad0 ffffffffa521d424 ffffc90031ebbaf0 ffffffffc08fe435
[ 59.975104] 0000000000000000 ffff88105002f900 ffffc90031ebbb60 ffffffffc08e68d8
[ 59.975106] Call Trace:
[ 59.975117] [] dump_stack+0x45/0x5c
[ 59.975123] [] report_size_overflow+0x24/0x30
[ 59.975136] [] zil_itx_create+0xa5/0xb0 [zfs]
[ 59.975147] [] zfs_log_create+0xd8/0x3c0 [zfs]
[ 59.975157] [] zfs_create+0x61a/0x6f0 [zfs]
[ 59.975164] [] zpl_vap_init+0x578/0x930 [zfs]
[ 59.975170] [] ? security_inode_permission+0x24/0x40
[ 59.975174] [] vfs_create+0xa2/0xd0
[ 59.975177] [] do_last.isra.37+0x1041/0x1200
[ 59.975180] [] ? security_file_alloc+0x1e/0x30
[ 59.975182] [] path_openat+0xb4/0x640
[ 59.975186] [] ? handle_mm_fault+0x74e/0xef0
[ 59.975189] [] ? mmap_region+0x193/0x6b0
[ 59.975192] [] do_filp_open+0x35/0x90
[ 59.975195] [] ? __alloc_fd+0x7b/0x130
[ 59.975198] [] do_sys_open+0x124/0x290
[ 59.975200] [] SyS_open+0x19/0x30
[ 59.975203] [] tracesys+0xcc/0xd1

shizmob · 2014-12-21T22:15:18Z

In case someone would like to test, I have an (untested!) potential fix in https://github.com/Shizmob/spl/commit/32f4ceca6f977121b639fb2f89111792177746a3. Any input would be appreciated.

chrisrd · 2014-12-22T01:09:52Z

HI @shizmob , your P2ROUNDUP_TYPED() ignores the type argument: why is this safe (if it's safe, why do they have P2ROUNDUP_TYPED() vs P2ROUNDUP() in the first place)?

shizmob · 2014-12-22T01:16:42Z

I believe it's safe because, looking at the Linux implementation of round_up, it uses GCC's __typeof__ extension to receive the type to cast to: http://lxr.free-electrons.com/source/include/linux/kernel.h?v=3.2#L54. Presumably Solaris doesn't want to use this nonstandard extension, so they defined different APIs.

chrisrd · 2014-12-22T01:33:53Z

However your P2ROUNDUP_TYPED() will always use the type of the first argument, x, whereas the original uses an arbitrary(?) type passed in as the third argument: that's an interface change. Are you able to confirm the new version produces precisely the same result as the old usage in all relevant circumstances (even for unusual architectures and compilers)?

shizmob · 2014-12-22T01:58:43Z

I checked all usages of P2ROUNDUP_TYPED() in zfs and spl, and the type argument seems to match x's type in all cases except for one (modules/zfs/zil.c:1058), and I'm fairly sure the type difference does not matter there (4096 fits just fine in an int, the C specification mandates int to be at least two bytes wide).

As far as unusual compilers go, I'm not quite sure what you mean seeing as Linux only compiles with gcc and icc in the first place, to my knowledge. However, as noted above, a compiler/architecture(?) would have to be violating the C specification for this to break.

Also of note is that the change I made to spl has to be duplicated in libspl in the zfs repository.

perfinion · 2015-10-07T05:20:04Z

I went through the source and found all the places that use the P2ROUNDUP macro to see what kind of issues it might run into.

Original macros:

#define P2ROUNDUP_TYPED(x, align, type) (-(-(type)(x) & -(type)(align)))

All usages are completely unsigned, so the -x doesnt entirely make sense

module/zfs/zil.c:1061: wsz = P2ROUNDUP_TYPED(lwb->lwb_nused, ZIL_MIN_BLKSZ, uint64_t);
    uint64_t = int, 4096ULL, uint64_t
module/zfs/zil.c:1105: dlen = P2ROUNDUP_TYPED(lrw->lr_length, sizeof(uint64_t), uint64_t);
    uint64_t = uint64_t, size_t, uint64_t
module/zfs/zil.c:1194: lrsize = P2ROUNDUP_TYPED(lrsize, sizeof(uint64_t), size_t);
    size_t = size_t, size_t, size_t
module/zfs/sa.c:503: blocksize = P2ROUNDUP_TYPED(size, SPA_MINBLOCKSIZE, uint32_t);
    uint32_t = uint32_t, ULL, uint32_t
module/zfs/zio_checksum.c:169: size = P2ROUNDUP_TYPED(zilc->zc_nused, ZIL_MIN_BLKSZ, uint64_t)
    uint64_t = uint64_t, 4096ULL, uint64_t
module/zfs/zio_checksum.c:225: size = P2ROUNDUP_TYPED(nused, ZIL_MIN_BLKSZ, uint64_t);
    uint64_t = uint64_t, 4096ULL, uint64_t

#define P2ROUNDUP(x, align) (-(-(x) & -(align)))

Lots of places use P2ROUNDUP() but a lot of these are using other #define
constants so im not sure they should really be un-typed. It seems safer if all
of them were just cast to use typeof(x)

Proposed new macro:
Requirement is that x is never 0. otherwise the x-1 will underflow which the
PaX plugin will again catch. Since these are all sizes to read or lengths
they shouldnt really be 0 so pretty sure if there is a 0 then it is a real
error. If x is allowed to be 0 then we need to add a little to the macro.

Linux has the following which takes care of the typing already:

#define __round_mask(x, y) ((__typeof__(x))((y)-1))
#define round_up(x, y) ((((x)-1) | __round_mask(x, y))+1)

Is there ever a time when we want to rely on the type of the mask? if not then
just merging both into the same macro seems like a reasonable option.

#define P2ROUNDUP(x, align)    ((((x) - 1) | ((align) - 1)) + 1)
#define P2ROUNDUP_TYPED(x, align, type)    ((type)((x)-1) | (type)((y)-1))+1)
If we want P2ROUNDUP to just use the original type, then we could do:
#define P2ROUNDUP(x, align)    P2ROUNDUP_TYPED(x, align, __typeof__(x))

I am going to look through the other macros too before I prepare a proper pull request since a couple of the others look like they also have similar issues. Thoughts so far?

Also, this isnt a build issue, the gcc plugin is used at buildtime but these issues trigger during runtime when built with a PaX kernel

chrisrd · 2015-10-08T02:00:50Z

@perfinion keep in mind my upthread comment about my standing in the ZoL community, but to my mind changing these macros to make them PaX compatible requires a very good understanding of the sign extension and type promotion rules according to the C standards. I don't have that understanding myself, but your comment "All usages are completely unsigned, so the -x doesnt entirely make sense" makes me think you should be looking into this much further. Likewise, regarding "Is there ever a time when we want to rely on the type of the mask?", my preference would be to be referencing C standards about why the type of the mask doesn't matter.

perfinion · 2015-10-09T04:34:16Z

@chrisrd so for P2ROUNDUP_TYPED, there are no uses of it that are signed, so sign extension does not matter. The others it might matter i'll double check.

And by "All usages are completely unsigned, so the -x doesnt entirely make sense" I meant that taking the negative of an unsigned number doesnt entirely make logical sense. In C it ends up the equivalent of (~x+1) but if thats what is wanted, then better to just do that since it "makes sense" for both signed and unsigned.

And about "Is there ever a time when we want to rely on the type of the mask?" I did not mean in regard to the macros. I meant are there places in the code that use it relying on the type of the mask? Because from what I could see, there were several places that used the untyped macro but I think they should instead be using the typed one. An integer constant without any qualifiers is signed in C, and is used in many places where the other type is unsigned which seems like an error. casting to the type of x seems more correct. If it specifically needs the type of the mask then it is much clearer to use the _TYPED version

The issue is that currently there are many instances eg P2ROUNDUP(size, 8), that is technically P2ROUNDUP(size, (signed int)8), but having a negative size is mostly likely a bug.

chrisrd · 2015-10-09T05:57:56Z

@perfinion It seems my usage of "sign extension" was an error, it should have been something like "negation of unsigned quantities" - thanks for pointing that out. My point was that the negation of an unsigned quantity is a well specified operation in C and I was concerned you hadn't reached that level of understanding. I now understand your comment was more that, in the real world, negating an unsigned quantity doesn't make a lot of sense - which I agree with. (On the other hand, we're not the real world here, we're in "C" world: here be monsters aplenty!)

Overall, my concern is that any alternate form we come up with for these macros should be guaranteed, by the C specification, to behave identically to the original macros (apart from not triggering the PaX overflow detection of course!), otherwise we risk having subtly different behaviour to the other ZFS implementations which could be exceedingly difficult to track down.

I'm more than willing to accept that the current macros, and/or their usages, may actually be "wrong" in some way (e.g. your example of P2ROUNDUP(size, 8)). But if we come up with a better way of doing things which isn't guaranteed to be equivalent to the existing implementation, we need to get buyin from "upstream" (currently Illumos, AFAIUI) to avoid getting bitten by a C monster.

perfinion · 2015-10-18T20:07:35Z

okay @chrisrd, I finally figured out how to prove they are equivalent. dump this in test.c, there are no includes or anything.

// Set the type everywhere, try signed/unsigned and int/long/long long
#define thetype unsigned int

thetype old(thetype x, thetype align)
{
    // The original SPL macro
    return -(-(x) & -(align));
}

thetype negate(thetype x, thetype align)
{
    // -x == twos complement negation which is equivalent to ~x+1
    return ~(-(x) & -(align)) + 1;
}

thetype demorgan(thetype x, thetype align)
{
    // Demorgans law: ~(A & B) ==> ~A | ~B
    return ((~(-x)) | (~(-align))) + 1;
}

thetype new(thetype x, thetype align)
{
    // The new macro that does not trigger the overflow
    return ((((x) - 1) | ((align) - 1)) + 1);
}

int main(void)
{
    old((thetype)1234, (thetype)8);
    negate((thetype)1234, (thetype)8);
    demorgan((thetype)1234, (thetype)8);
    new((thetype)1234, (thetype)8);
    return 0;
}

now compile and dump the assembly. the stack options are because my hardened compiler defaults to enabling them and the assembly is more annoying to read with them enabled. Also play around with -O[0-3] and -Os. They all work the same but give slightly different outputs.

$ gcc -O1 -ggdb -fstack-check=no -fno-stack-protector test.c -o test
$ gdb --batch -ex "disas old" -ex "disas negate" -ex "disas demorgan" -ex "disas new" test
Dump of assembler code for function old:
   0x0000000000000760 <+0>: mov    %edi,%eax
   0x0000000000000762 <+2>: neg    %eax
   0x0000000000000764 <+4>: neg    %esi
   0x0000000000000766 <+6>: and    %esi,%eax
   0x0000000000000768 <+8>: neg    %eax
   0x000000000000076a <+10>:    retq   
End of assembler dump.
Dump of assembler code for function negate:
   0x000000000000076b <+0>: mov    %edi,%eax
   0x000000000000076d <+2>: neg    %eax
   0x000000000000076f <+4>:     neg    %esi
   0x0000000000000771 <+6>: and    %esi,%eax
   0x0000000000000773 <+8>: neg    %eax
   0x0000000000000775 <+10>:    retq   
End of assembler dump.
Dump of assembler code for function demorgan:
   0x0000000000000776 <+0>: lea    -0x1(%rdi),%eax
   0x0000000000000779 <+3>: sub    $0x1,%esi
   0x000000000000077c <+6>: or     %esi,%eax
   0x000000000000077e <+8>: add    $0x1,%eax
   0x0000000000000781 <+11>:    retq   
End of assembler dump.
Dump of assembler code for function new:
   0x0000000000000782 <+0>: lea    -0x1(%rdi),%eax
   0x0000000000000785 <+3>: sub    $0x1,%esi
   0x0000000000000788 <+6>: or     %esi,%eax
   0x000000000000078a <+8>: add    $0x1,%eax
   0x000000000000078d <+11>:    retq   
End of assembler dump.

So from old->negate, the twos complement is absolutely no change to the generated asm. negate->demorgan is just demorgans law which is fine too. Then from demorgan->new there is no difference in the generated asm so they are also the same. I think this should suffice that the new and old are equivalent. do try it on other compilers and see if we get the same. I tried clang too and it is exactly the same.

chrisrd · 2015-10-19T22:59:42Z

@perfinion Looks good to me! I still have one question though: with the new implementation, for x = 0, is there a difference in behaviour compared to a non-PaX system?

perfinion · 2015-10-24T05:56:05Z

both the new and old are the same for x = 0; old: 0 & anything == 0; and new: 0 -1 | anything + 1= -1 + 1 so they both end up just returning zero no matter what the align arg is. This will still trigger a pax size overflow for the new one but I think that is good because zero is almost definitely wrong.

ryao · 2015-10-26T23:09:09Z

@chrisrd The two are equivalent for all inputs. I sketched out a mathematical proof of equivalence after hearing @perfinion's explanation in IRC earlier today:

Here are axioms:

a. (-(x)) === (~((x) - 1)) === (~(x) + 1) under two's complement
b. ~(x & y) === ((~(x)) | (~(y))) under De Morgan's law
c. ~(~x) === x under the law of the excluded middle

Here is a proof:

0. (-(-(x) & -(align)))
1. (~(-(x) & -(align)) + 1) by a
2. (((~(-(x))) | (~(-(align)))) + 1) by b
3. (((~(~((x) - 1))) | (~(~((align) - 1)))) + 1) by a
4. (((((x) - 1)) | (((align) - 1))) + 1) by c

Then if we just clean up the unnecessarily parenthesis, we arrive at @perfinion's code.

#3949 (comment)

chrisrd · 2015-10-27T00:49:43Z

@ryao Yes, I saw your #3949 (comment) and appreciate the formalisation of @perfinion's logic at #2505 (comment).

However the mathematical proof doesn't take into account C's type promotion rules: i.e. for the different sequence of operations (at the C compiler level), do the C type promotion rules guarantee the new macro ends up with the same result as the original?

On the other hand I've basically come around to the view that, whilst insisting on guarantees of equivalence per the C spec might be the technically correct thing to do, we're just as likely (which is not very likely at all) to see differences in behaviour in the original macro between platforms due to differences between compilers and/or optimisation settings etc. I.e. it's not worth worrying too much about the exquisite details of the type promotion rules as long as we're comfortably sure the behaviour of the two macros is the same for all cases we care about.

I also have a slight concern regarding triggering the PaX overflow for the x = 0 case which would be a change in behaviour from non-PaX ZoL, per my comment at #2505 (comment):

Are there any guarantees in the ZFS code that x won't be zero? Or, if it is zero, is that an indication of an error or filesystem corruption etc. that we want the overflow detector to catch?

...but, like @perfinion, I'm reasonably sure the x = 0 case is going to mean something has already gone terribly wrong and triggering the PaX overflow isn't going to make things worse and may well help avoid filesystem damage further down the track. If we really cared about the x = 0 behaviour differences then the grsecurity thread previously referenced has a solution that only adds a few CPU cycles.

I wonder what the chances are of getting the updated macro into OpenZFS?

ryao · 2015-10-27T15:14:16Z

@chrisrd Good point about the type coercion rules not being included in the proof. One way to formally decide whether the type coercion rules pose a problem would seem to be to look at each of the axioms in the proof and answer the question "does the axiom break under C's type coercion rules?". If the answer is no for all of them, then the proof holds for type coercion. If the answer is yes for any of them, then the proof breaks.

The only axiom where I suspect anything might be different is the one from two's complement where the answer depends on how the compiler does coercion with the constant. If the constant is automatically coerced to the type of the variable, we are fine because whatever coercion that follows would be the same as it would have been between x and align under the original. If the constant is an int, which is the default in C, then we have two cases. Either we coerce between an unsigned integral type and a signed integral type, and/or we coerce between types of different lengths. In the former case, the coercion should be unsigned. In the latter case, the coercion should go to the larger width. The former case should be fine while the latter case poses a problem if the macro is used in a context where it is being assigned to a smaller bit width where we would coerce from a smaller width to a larger width and generate a warning. This would become an error with -Werror, but assuming that does not occur, the final result would otherwise be the same.

It should be said that I am assuming that type coercion is the same regardless of how expr1 op expr2 occurs in the compiler's abstraction syntax tree. For all possible op, wehther we have expr1 be type A and expr2 be type B or expr1 b type B and expr2 be type A, I assume that the result under C's type coercion rules would be the same. I believe this is a fair assumption, although for true formalism, this needs to be checked. I should also admit that I am doing some handwaving here WRT to the other axioms. My reasoning is that the & and | operators are equivalent in terms of type coercion while the ~ operator should have no effect on type coercion. These are also things that need to be checked for a truly formal proof.

Those caveats aside, I believe that the correct answer on equivalence when type coercion is taken into consideration should be how coercion of the width of the constant is handled in a compiler. I am inclined to think that a compiler would adopt the width of the first coercion that occurs since the 1 can safely fit in any integral type because that is how I would see myself handling this if I wrote a compiler front end (I actually have, but it was not a C front end and it just a college project). This needs some investigation, but even if the C specification provides favorable behavior, it is still possible for there to be compiler bugs in implementing it.

As for getting the updated macro into OpenZFS, I think the chances are good following a discussion about upstreaming this that I had yesterday in #illumos on the freenode IRC network.

ryao · 2015-10-28T12:54:20Z

Given that the only risk of non-equivalence should be a compiler warning, we could just assume it is equivalent unless the compiler warns otherwise. I could try formalizing my informal reasoning on C type coercion. It would be an interesting exercise for learning how to apply things that I learned in school, but blocking the merge of the patch on complete formal verification is overkill, so it should be fine to merge without that.

@behlendorf Do you see anything else that we need to check?

chrisrd · 2015-10-28T20:45:13Z

@ryao In sort, I agree this is fine to go in.

However I'm not sure what you're thinking about there: the "non-equivalence" of what objects, and what would the compiler would be warning about?

I don't think looking at the type promotion (whether coerced or automatic) involved in each step of the mathematical proof is going to be much use to us: the C compiler won't be following the proof to transform from the first macro to the second as the proof does, the compiler will be following whatever wild and woolly path the compiler writer came up, almost certainly whilst 3 days sleepless and keeping going by sheer willpower and ingestion of copious mind altering substances (the end results are generally indistinguishable from those produced by a wizard of optimisation), to get to the end point of rounding up the value.

The best we can hope for, for either macro, is the end result is in accordance with how a sane mind would interpret the rules of C.

That's basically why I'm now more relaxed about demanding guarantees of equivalence between the macros (i.e. macro to end result, not macro to macro): it would be nice if someone were able to rigorously show what should happen per the C type promotion rules for all possible types (we already have the mathematical side covered), but that would still not guarantee the compiler will actually follow the rules itself. (Sure, if we found the compiler wasn't following the rules we could try get it fixed, but that doesn't help in the short to medium term.) I think the best we can do is ensure the new macro is sane and actually works as desired in the cases we care about now. The esoteric and currently non-existing cases I was previously expressing concern about (odd new types and/or strange new compilers being used) can be dealt with if and when they come about.

ryao · 2015-10-30T14:18:52Z

@chrisrd Is your concern the situation where GCC simultaneously miscompiles the new code and fails to issue a warning? I am not concerned about that possibility with the new macro anymore than I am concerned about that possibility with the old macro. While miscompilation is always a matter of concern, that deserves its own issue and a separate resolution in the form of a switch to a formally verified compiler such as the CompCert C compiler:

http://compcert.inria.fr/

That said, I consider the biggest risk to be that GCC will evaluate the expression formed by the macro to have a different integral type than the original in its Abstract Syntax Tree. If that occurs, the compiler will overzealously increase the bit width of the variable rather than coercing the bit width of the constant and/or will treat the constant as a signed type. Provided that the GCC follows the rules of C, the latter is not a problem while the former would only cause a problem should the width increase too much upon assignment. That should cause GCC to generate a compiler warning about an integral type being truncated upon assignment to an lval. This should occur in "semantic analysis" as shown in this diagram:

http://jcsites.juniata.edu/faculty/rhodes/lt/images/ccover4.gif

chrisrd · 2015-11-06T03:18:44Z

@ryao (apologies for the downtime...) my concern was that the new macro should produce the same end results as the old macro in all possible cases, according to the rules of C (including type promotion (TP) and expression evaluation ordering (EEO) etc.). What I was looking for was a proof along the lines of "ok, the original macro works like this, according to the rules of C (including TP and EEO), and the new macro works like this other, again according to the rules of C, and hence we see that the two produce the same results for all possible cases (assuming the compiler is strictly conforming)".

I'm somewhat more relaxed about wanting that rigorous approach now that I'm more familiar with the C type promotion rules (thanks to this issue) and the ways the macro is used in current code. I.e. I'm comfortable the new macro will work for current usage and reasonable future usage, and I think any future breakage (i.e. differences in output of the old and new macros) due to my hypothetical "odd new types and/or strange new compilers being used" are just as (un!)likely as any future breakage due to compiler bugs etc.

behlendorf · 2015-11-11T23:39:16Z

@chrisrd @perfinion @ryao thank you all for working on this. That patch looks good to me and you all agreed it's correctly I'll get it merged.

behlendorf · 2015-11-13T23:25:30Z

OK, after giving this a careful looking over I'm convinced it's correct too. I've merged it. Thank you everyone! Merged as:

openzfs/spl@8fc851b sysmacros: Make P2ROUNDUP not trigger int overflow

perfinion · 2015-11-16T05:02:12Z

@behlendorf Thanks for pulling this in!

you need to pull in the second pull request too. The header file from spl is in the zfs repo too: https://github.com/zfsonlinux/zfs/blob/master/lib/libspl/include/sys/sysmacros.h#L53-L81

This pull request does the exact same fix for that file too: #3949
It also removes the duplicated definition.

sysmacros: Make P2ROUNDUP not trigger int overflow The original P2ROUNDUP and P2ROUNDUP_TYPED macros contain -x which triggers PaX's integer overflow detection for unsigned integers. Replace the macros with an equivalent version that does not trigger the overflow. Axioms: A. (-(x)) === (~((x) - 1)) === (~(x) + 1) under two's complement. B. ~(x & y) === ((~(x)) | (~(y))) under De Morgan's law. C. ~(~x) === x under the law of excluded middle. Proof: 0. (-(-(x) & -(align))) original 1. (~(-(x) & -(align)) + 1) by A 2. (((~(-(x))) | (~(-(align)))) + 1) by B 3. (((~(~((x) - 1))) | (~(~((align) - 1)))) + 1) by A 4. (((((x) - 1)) | (((align) - 1))) + 1) by C Q.E.D. Signed-off-by: Jason Zaman <[email protected]> Reviewed-by: Chris Dunlop <[email protected]> Reviewed-by: Richard Yao <[email protected]> Signed-off-by: Brian Behlendorf <[email protected]> Closes openzfs/zfs#2505 Closes #488

behlendorf · 2015-11-17T00:17:15Z

@perfinion thanks for the reminder. I've just pulled it in.

The patches to fix this have been merged into master now. openzfs/zfs#2505 Package-Manager: portage-2.2.20.1

The original P2ROUNDUP and P2ROUNDUP_TYPED macros contain -x which triggers PaX's integer overflow detection for unsigned integers. Replace the macros with an equivalent version that does not trigger the overflow. Axioms: A. (-(x)) === (~((x) - 1)) === (~(x) + 1) under two's complement. B. ~(x & y) === ((~(x)) | (~(y))) under De Morgan's law. C. ~(~x) === x under the law of excluded middle. Proof: 0. (-(-(x) & -(align))) original 1. (~(-(x) & -(align)) + 1) by A 2. (((~(-(x))) | (~(-(align)))) + 1) by B 3. (((~(~((x) - 1))) | (~(~((align) - 1)))) + 1) by A 4. (((((x) - 1)) | (((align) - 1))) + 1) by C Q.E.D. Signed-off-by: Jason Zaman <[email protected]> Reviewed-by: Chris Dunlop <[email protected]> Reviewed-by: Richard Yao <[email protected]> Signed-off-by: Brian Behlendorf <[email protected]> Closes openzfs/zfs#2505 Closes openzfs#488

The original P2ROUNDUP and P2ROUNDUP_TYPED macros contain -x which triggers PaX's integer overflow detection for unsigned integers. Replace the macros with an equivalent version that does not trigger the overflow. Axioms: A. (-(x)) === (~((x) - 1)) === (~(x) + 1) under two's complement. B. ~(x & y) === ((~(x)) | (~(y))) under De Morgan's law. C. ~(~x) === x under the law of excluded middle. Proof: 0. (-(-(x) & -(align))) original 1. (~(-(x) & -(align)) + 1) by A 2. (((~(-(x))) | (~(-(align)))) + 1) by B 3. (((~(~((x) - 1))) | (~(~((align) - 1)))) + 1) by A 4. (((((x) - 1)) | (((align) - 1))) + 1) by C Q.E.D. Signed-off-by: Jason Zaman <[email protected]> Reviewed-by: Chris Dunlop <[email protected]> Reviewed-by: Richard Yao <[email protected]> Signed-off-by: Brian Behlendorf <[email protected]> Closes openzfs/zfs#2505 Closes #488

The patches to fix this have been merged into master now. openzfs/zfs#2505 Package-Manager: portage-2.2.20.1

behlendorf added this to the 0.7.0 milestone Jul 18, 2014

behlendorf added the Bug label Jul 18, 2014

rennhak mentioned this issue Nov 30, 2014

ZFS/SPL built inline of GNU/Linux 3.16.5 + Grsec + Pax + Xen fatal error #2921

Closed

behlendorf removed this from the 0.7.0 milestone Dec 19, 2014

behlendorf added Bug - Minor and removed Bug labels Dec 19, 2014

chrisrd mentioned this issue Dec 20, 2014

SPL requires CONFIG_PAX_SIZE_OVERFLOW and CONFIG_PAX_RANDKSTACK to be disabled openzfs/spl#422

Closed

This was referenced Oct 24, 2015

sys/sysmacros.h: Fix P2ROUNDUP_TYPED to not trigger integer overflow openzfs/spl#488

Closed

libspl sys/sysmacros.h: Fix P2ROUNDUP_TYPED to not trigger integer overflow #3949

Closed

behlendorf closed this as completed in openzfs/spl@8fc851b Nov 13, 2015

This was referenced Nov 18, 2015

Kernel Oops on 32-bit ARM #3475

Closed

BUG: unable to handle kernel paging request @ dmu_write (32-bit only?) #3447

Closed

perfinion added a commit to gentoo/gentoo that referenced this issue Nov 23, 2015

sys-kernel/spl: remove PAX_SIZE_OVERFLOW check

e5fce67

The patches to fix this have been merged into master now. openzfs/zfs#2505 Package-Manager: portage-2.2.20.1

gordan-bobic mentioned this issue Nov 26, 2015

Kernel Oops on 32-bit ARM #4048

Closed

sempervictus mentioned this issue Oct 4, 2017

ZFS Crypto: size overflow detected #6714

Closed

gentoo-repo-qa-bot pushed a commit to gentoo-mirror/linux-be that referenced this issue Oct 6, 2019

sys-kernel/spl: remove PAX_SIZE_OVERFLOW check

fa63c45

The patches to fix this have been merged into master now. openzfs/zfs#2505 Package-Manager: portage-2.2.20.1

PAX: size overflow detected in function zil_itx_create #2505

PAX: size overflow detected in function zil_itx_create #2505

Comments

mrobbetts commented Jul 17, 2014

behlendorf commented Jul 18, 2014

chrisrd commented Jul 18, 2014

ryao commented Jul 18, 2014

chrisrd commented Jul 18, 2014

chrisrd commented Jul 21, 2014

chrisrd commented Jul 28, 2014

mrobbetts commented Jul 30, 2014

chrisrd commented Jul 31, 2014

rennhak commented Nov 28, 2014

rennhak commented Nov 30, 2014

rennhak commented Dec 1, 2014

gcs-github commented Dec 9, 2014

chrisrd commented Dec 9, 2014

gcs-github commented Dec 10, 2014

chrisrd commented Dec 10, 2014

N8Fear commented Dec 11, 2014

alexxy commented Dec 11, 2014

uname -a

LANG=C gcc -v

shizmob commented Dec 21, 2014

chrisrd commented Dec 22, 2014

shizmob commented Dec 22, 2014

chrisrd commented Dec 22, 2014

shizmob commented Dec 22, 2014

perfinion commented Oct 7, 2015

chrisrd commented Oct 8, 2015

perfinion commented Oct 9, 2015

chrisrd commented Oct 9, 2015

perfinion commented Oct 18, 2015

chrisrd commented Oct 19, 2015

perfinion commented Oct 24, 2015

ryao commented Oct 26, 2015

chrisrd commented Oct 27, 2015

ryao commented Oct 27, 2015

ryao commented Oct 28, 2015

chrisrd commented Oct 28, 2015

ryao commented Oct 30, 2015

chrisrd commented Nov 6, 2015

behlendorf commented Nov 11, 2015

behlendorf commented Nov 13, 2015

perfinion commented Nov 16, 2015

behlendorf commented Nov 17, 2015