Add check for sched_yield in librt #31

worr · 2014-09-20T05:01:10Z

In Solaris, sched_yield lives in librt, rather than libc. This patch adds a
check which will link in librt if necessary.

In Solaris, sched_yield lives in librt, rather than libc. This patch adds a check which will link in librt if necessary.

cbsmith · 2014-09-21T23:46:21Z

This looks perfect. Small/compact/goes from "not work" to "work".

xfxyjwf · 2014-09-22T17:02:06Z

Thanks!

Add check for sched_yield in librt

Moved DynASM to third_party to comply with Google policy.

Loop body before: ``` .LBB0_2: add w8, w12, #1 cmp w8, w11 b.gt .LBB0_6 // Predictable branch, ends the loop .LBB0_3: add w12, w8, w11 add w12, w12, w12, lsr #31 asr w12, w12, #1 smaddl x0, w12, w10, x9 ldr w13, [x0] cmp w13, w1 b.lo .LBB0_2 // Unpredictable branch here! Will be hit 50/50 in prod b.ls .LBB0_7 // Predictable branch - ends the loop sub w11, w12, #1 cmp w8, w11 b.le .LBB0_3 // Predictable branch - continues the loop ``` Loop body after: ``` .LBB7_1: cmp w9, w11 b.hi .LBB7_4 // Predictable branch - ends the loop add w12, w9, w11 lsr w12, w12, #1 umaddl x0, w12, w8, x10 sub w14, w12, #1 ldr w13, [x0] cmp w13, w1 csel w11, w14, w11, hs csinc w9, w9, w12, hs b.ne .LBB7_1 // Predictable branch - continues the loop ``` PiperOrigin-RevId: 700864625

On a Cortex-A55 this resulted in a 28.30% reduction in CPU and wall time for the binary search path. Loop body before: ``` .LBB0_2: add w8, w12, #1 cmp w8, w11 b.gt .LBB0_6 // Predictable branch, ends the loop .LBB0_3: add w12, w8, w11 add w12, w12, w12, lsr #31 asr w12, w12, #1 smaddl x0, w12, w10, x9 ldr w13, [x0] cmp w13, w1 b.lo .LBB0_2 // Unpredictable branch here! Will be hit 50/50 in prod b.ls .LBB0_7 // Predictable branch - ends the loop sub w11, w12, #1 cmp w8, w11 b.le .LBB0_3 // Predictable branch - continues the loop ``` Loop body after: ``` .LBB7_1: cmp w9, w11 b.hi .LBB7_4 // Predictable branch - ends the loop add w12, w9, w11 lsr w12, w12, #1 umaddl x0, w12, w8, x10 sub w14, w12, #1 ldr w13, [x0] cmp w13, w1 csel w11, w14, w11, hs csinc w9, w9, w12, hs b.ne .LBB7_1 // Predictable branch - continues the loop ``` PiperOrigin-RevId: 700864625

On a Cortex-A55 this resulted in a 28.30% reduction in CPU and wall time for the binary search path. Loop body before: ``` .LBB0_2: add w8, w12, #1 cmp w8, w11 b.gt .LBB0_6 // Predictable branch, ends the loop .LBB0_3: add w12, w8, w11 add w12, w12, w12, lsr #31 asr w12, w12, #1 smaddl x0, w12, w10, x9 ldr w13, [x0] cmp w13, w1 b.lo .LBB0_2 // Unpredictable branch here! Will be hit 50/50 in prod b.ls .LBB0_7 // Predictable branch - ends the loop sub w11, w12, #1 cmp w8, w11 b.le .LBB0_3 // Predictable branch - continues the loop ``` Loop body after: ``` .LBB7_1: cmp w9, w11 b.hi .LBB7_4 // Predictable branch - ends the loop add w12, w9, w11 lsr w12, w12, #1 umaddl x0, w12, w8, x10 sub w14, w12, #1 ldr w13, [x0] cmp w13, w1 csel w11, w14, w11, hs csinc w9, w9, w12, hs b.ne .LBB7_1 // Predictable branch - continues the loop ``` PiperOrigin-RevId: 703213921

On a Cortex-A55 this resulted in a 28.30% reduction in CPU and wall time for the binary search path. Loop body before: ``` .LBB0_2: add w8, w12, #1 cmp w8, w11 b.gt .LBB0_6 // Predictable branch, ends the loop .LBB0_3: add w12, w8, w11 add w12, w12, w12, lsr #31 asr w12, w12, #1 smaddl x0, w12, w10, x9 ldr w13, [x0] cmp w13, w1 b.lo .LBB0_2 // Unpredictable branch here! Will be hit 50/50 in prod b.ls .LBB0_7 // Predictable branch - ends the loop sub w11, w12, #1 cmp w8, w11 b.le .LBB0_3 // Predictable branch - continues the loop ``` Loop body after: ``` .LBB7_1: cmp w9, w11 b.hi .LBB7_4 // Predictable branch - ends the loop add w12, w9, w11 lsr w12, w12, #1 umaddl x0, w12, w8, x10 sub w14, w12, #1 ldr w13, [x0] cmp w13, w1 csel w11, w14, w11, hs csinc w9, w9, w12, hs b.ne .LBB7_1 // Predictable branch - continues the loop ``` PiperOrigin-RevId: 703214356

Add check for sched_yield in librt

38b8494

In Solaris, sched_yield lives in librt, rather than libc. This patch adds a check which will link in librt if necessary.

xfxyjwf added a commit that referenced this pull request Sep 22, 2014

Merge pull request #31 from worr/bug/autoconf-sched-yield

a48c08a

Add check for sched_yield in librt

xfxyjwf merged commit a48c08a into protocolbuffers:master Sep 22, 2014

Harshasa mentioned this pull request Mar 12, 2015

v3.0.0-alpha-2 make check has 4 test failures on Windows/mingw #233

Closed

jakiechris mentioned this pull request Nov 20, 2017

[solved, pls close this issue] protobuf crashes on android 4.4.2 on Huawei p7 #3922

Closed

TeBoring pushed a commit to TeBoring/protobuf that referenced this pull request Jan 19, 2019

Merge pull request protocolbuffers#31 from haberman/third_party

36962f1

Moved DynASM to third_party to comply with Google policy.

arnow117 mentioned this pull request Apr 1, 2019

python: SIGSEGV when use PyImport_Import import symbol_database #5979

Closed

mrspirytus mentioned this pull request Dec 1, 2020

Linking error with on Ubuntu 18.04, Works on 20.04 #8107

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add check for sched_yield in librt #31

Add check for sched_yield in librt #31

worr commented Sep 20, 2014

cbsmith commented Sep 21, 2014

xfxyjwf commented Sep 22, 2014

Add check for sched_yield in librt #31

Add check for sched_yield in librt #31

Conversation

worr commented Sep 20, 2014

cbsmith commented Sep 21, 2014

xfxyjwf commented Sep 22, 2014