Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CI failures: 20180927-20181002 #18

Closed
joyeecheung opened this issue Oct 2, 2018 · 16 comments
Closed

CI failures: 20180927-20181002 #18

joyeecheung opened this issue Oct 2, 2018 · 16 comments

Comments

@joyeecheung
Copy link
Member

joyeecheung commented Oct 2, 2018

Failures in node-test-pull-request/17485 to node-test-pull-request/17581 that failed more than 2 PRs

Jenkins Failure

Build Failure

Reason Failed in Propagate Binaries phase (git-nodesource-update-reference)
Type BUILD_FAILURE
Failed PR 5 (nodejs/node#23112, nodejs/node#23083, nodejs/node#23077, nodejs/node#23075, nodejs/node#23115)
Appeared undefined
First CI https://ci.nodejs.org/job/node-test-pull-request/17485/
Last CI https://ci.nodejs.org/job/node-test-pull-request/17489/
Example
Failed in Propagate Binaries phase (git-nodesource-update-reference)

Reason sh: line 42: pgrep: command not found
Type BUILD_FAILURE
Failed PR 2 (nodejs/node#22631, nodejs/node#23110)
Appeared test-osuosl-aix61-ppc64_be-1, test-osuosl-aix61-ppc64_be-2
First CI https://ci.nodejs.org/job/node-test-pull-request/17535/
Last CI https://ci.nodejs.org/job/node-test-pull-request/17570/
Example
sh: line 42: pgrep: command not found

JSTest Failure

Reason sequential/test-gc-http-client-timeout
Type JS_TEST_FAILURE
Failed PR 12 (nodejs/node#23112, nodejs/node#23145, nodejs/node#23102, nodejs/node#23094, nodejs/node#23101, nodejs/node#23115, nodejs/node#23184, nodejs/node#23141, nodejs/node#23196, nodejs/node#23132, nodejs/node#23208, nodejs/node#23204)
Appeared test-rackspace-win2008r2-x64-3, test-rackspace-win2012r2-x64-5, test-macstadium-macos10.11-x64-1, test-digitalocean-alpine37_container-x64-1, test-rackspace-win2008r2-x64-6, test-digitalocean-ubuntu1804_container-x64-1, test-osuosl-aix61-ppc64_be-2, test-digitalocean-ubuntu1804_container-x64-2, test-digitalocean-ubuntu1604_sharedlibs_container-x64-6
First CI https://ci.nodejs.org/job/node-test-pull-request/17485/
Last CI https://ci.nodejs.org/job/node-test-pull-request/17574/
Example
not ok 582 sequential/test-gc-http-client-timeout
  ---
  duration_ms: 120.84
  severity: fail
  exitcode: 1
  stack: |-
    timeout
    We should do 550 requests
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeou...

Reason parallel/test-net-connect-options-port
Type JS_TEST_FAILURE
Failed PR 11 (nodejs/node#23075, nodejs/node#23134, nodejs/node#23145, nodejs/node#23146, nodejs/node#23138, nodejs/node#23101, nodejs/node#23156, nodejs/node#23131, nodejs/node#23112, nodejs/node#23196, nodejs/node#23168)
Appeared test-rackspace-win2008r2-x64-5, test-packetnet-ubuntu1604-arm64-2, test-digitalocean-alpine37_container-x64-1, test-packetnet-ubuntu1604-arm64-1, test-softlayer-centos6-x64-1, test-digitalocean-alpine37_container-x64-2, test-rackspace-win2012r2-x64-3
First CI https://ci.nodejs.org/job/node-test-pull-request/17488/
Last CI https://ci.nodejs.org/job/node-test-pull-request/17562/
Example
not ok 326 parallel/test-net-connect-options-port
  ---
  duration_ms: 120.87
  severity: fail
  exitcode: 1
  stack: |-
    timeout
  ...


Reason sequential/test-gc-http-client
Type JS_TEST_FAILURE
Failed PR 10 (nodejs/node#23112, nodejs/node#23146, nodejs/node#23131, nodejs/node#23189, nodejs/node#23151, nodejs/node#23164, nodejs/node#22381, nodejs/node#23156, nodejs/node#23204, nodejs/node#23206)
Appeared test-digitalocean-alpine37_container-x64-2, test-digitalocean-ubuntu1804_container-x64-2, test-rackspace-win2012r2-x64-3, test-rackspace-win2008r2-x64-4, test-digitalocean-alpine37_container-x64-1, test-rackspace-win2012r2-x64-8, test-joyent-ubuntu1604_sharedlibs_container-x64-3
First CI https://ci.nodejs.org/job/node-test-pull-request/17485/
Last CI https://ci.nodejs.org/job/node-test-pull-request/17567/
Example
not ok 2307 sequential/test-gc-http-client
  ---
  duration_ms: 120.18
  severity: fail
  exitcode: -15
  stack: |-
    timeout
    We should do 300 requests
    Done: 0/300
    Collected: 0/45
    Done: 15/300
    Collected: 0/75
    Done: 30/300
    Collected: 0/90
    Done: 45/300
    Collected: 15/105
    Done: 60/300
    Collected: 30/120
    Done: 75/300
    Collected: 45/135
    Done: 90/300
    Collected: 60/150
    Done: 105/300
    Collected: 75/165
    Done: 120/300
    Collected: 90/180
    Done: 135/300
    Collected: 105/195
    Done: 150/300
    Collected: 120/210
    Done: 165/300
    Collected: 135/225
    Done: 180/300
    Collected: 150/240
    Done: 195/300
    Collected: 165/255
    Done: 210/300
    Collected: 180/270
    Done: 210/300
    Collected: 195/300
    Done: 210/300
    Collected: 210/300
    Done: 210/300
    Collected: 210/300
    Done: 210/300
    Collected: 210/300
    Done: 210/300
    Collected: 210/300
    Done: 210/300
    Collected: 210/300
    Done: 210/300
    Collec...

Reason sequential/test-gc-http-client-onerror
Type JS_TEST_FAILURE
Failed PR 10 (nodejs/node#23145, nodejs/node#23115, nodejs/node#23141, nodejs/node#23109, nodejs/node#23168, nodejs/node#23151, nodejs/node#23206, nodejs/node#23178, nodejs/node#23173, nodejs/ecmascript-modules#6)
Appeared test-rackspace-win2008r2-x64-6, test-rackspace-win2008r2-x64-3, test-digitalocean-alpine37_container-x64-1, test-rackspace-win2008r2-x64-5, test-rackspace-win2012r2-x64-2, test-digitalocean-alpine37_container-x64-2, test-rackspace-win2012r2-x64-7, test-rackspace-win2008r2-x64-4, test-rackspace-win2012r2-x64-8
First CI https://ci.nodejs.org/job/node-test-pull-request/17497/
Last CI https://ci.nodejs.org/job/node-test-pull-request/17581/
Example
not ok 581 sequential/test-gc-http-client-onerror
  ---
  duration_ms: 120.130
  severity: fail
  exitcode: 1
  stack: |-
    timeout
    We should do 500 requests
    Done: 19/500
    Collected: 0/60
    Done: 80/500
    Collected: 0/120
    Done: 120/500
    Collected: 60/170
    Done: 180/500
    Collected: 119/230
    Done: 213/500
    Collected: 179/390
    Done: 213/500
    Collected: 213/500
    Done: 213/500
    Collected: 213/500
    Done: 213/500
    Collected: 213/500
    Done: 213/500
    Collected: 213/500
    Done: 213/500
    Collected: 213/500
    Done: 213/500
    Collected: 213/500
    Done: 213/500
    Collected: 213/500
    Done: 213/500
    Collected: 213/500
    Done: 213/500
    Collected: 213/500
    Done: 213/500
    Collected: 213/500
    Done: 213/500
    Collected: 213/500
    Done: 213/500
    Collected: 213/500
    Done: 213/500
    Collected: 213/500
    Done: 213/500
    Collected: 213/500
    Done: 213/500
    Collected: 213/500
    Done: 213/500
    Collected: 213/500
    Don...

Reason sequential/test-gc-net-timeout
Type JS_TEST_FAILURE
Failed PR 10 (nodejs/node#23075, nodejs/node#23100, nodejs/node#23156, nodejs/node#23112, nodejs/node#23141, nodejs/node#23132, nodejs/node#23204, nodejs/node#22947, nodejs/node#23173, nodejs/ecmascript-modules#6)
Appeared test-rackspace-win2012r2-x64-4, test-rackspace-win2012r2-x64-1, test-digitalocean-fedora26-x64-1, test-softlayer-ubuntu1404-x64-1, test-digitalocean-ubuntu1604_sharedlibs_container-x64-4, test-digitalocean-alpine37_container-x64-1, test-digitalocean-ubuntu1804_container-x64-1, test-rackspace-win2008r2-x64-3, test-digitalocean-ubuntu1604_sharedlibs_container-x64-6, test-joyent-smartos17-x64-1
First CI https://ci.nodejs.org/job/node-test-pull-request/17488/
Last CI https://ci.nodejs.org/job/node-test-pull-request/17581/
Example
not ok 582 sequential/test-gc-net-timeout
  ---
  duration_ms: 120.84
  severity: fail
  exitcode: 1
  stack: |-
    timeout
    We should do 500 requests
    Done: 110/500
    Collected: 0/143
    Done: 346/500
    Collected: 92/385
    Done: 500/500
    Collected: 92/500
    Done: 500/500
    Collected: 92/500
    Done: 500/500
    Collected: 92/500
    Done: 500/500
    Collected: 92/500
    Done: 500/500
    Collected: 92/500
    Done: 500/500
    Collected: 92/500
    Done: 500/500
    Collected: 92/500
    Done: 500/500
    Collected: 92/500
    Done: 500/500
    Collected: 92/500
    Done: 500/500
    Collected: 92/500
    Done: 500/500
    Collected: 92/500
    Done: 500/500
    Collected: 92/500
    Done: 500/500
    Collected: 92/500
    Done: 500/500
    Collected: 92/500
    Done: 500/500
    Collected: 92/500
    Done: 500/500
    Collected: 92/500
    Done: 500/500
    Collected: 92/500
    Done: 500/500
    Collected: 92/500
    Done: 500/500
    Collected: 92/500
    Done: 500/500
    Collecte...

Reason parallel/test-gc-http-client-connaborted
Type JS_TEST_FAILURE
Failed PR 8 (nodejs/node#23100, nodejs/node#23117, nodejs/node#23118, nodejs/node#23131, nodejs/node#23083, nodejs/node#23092, nodejs/node#23101, nodejs/node#23108)
Appeared test-rackspace-win2008r2-x64-6, test-rackspace-win2012r2-x64-5, test-softlayer-centos7-x64-1, test-rackspace-win2012r2-x64-7, test-digitalocean-alpine37_container-x64-2, test-digitalocean-ubuntu1604_sharedlibs_container-x64-1, test-digitalocean-alpine37_container-x64-1, test-macstadium-macos10.11-x64-2, test-rackspace-win2008r2-x64-4
First CI https://ci.nodejs.org/job/node-test-pull-request/17494/
Last CI https://ci.nodejs.org/job/node-test-pull-request/17545/
Example
not ok 184 parallel/test-gc-http-client-connaborted
  ---
  duration_ms: 120.81
  severity: fail
  exitcode: 1
  stack: |-
    timeout
    We should do 500 requests
    Done: 43/500
    Collected: 0/88
    Done: 110/500
    Collected: 43/165
    Done: 197/500
    Collected: 110/242
    Done: 264/500
    Collected: 197/319
    Done: 294/500
    Collected: 264/495
    Done: 294/500
    Collected: 294/500
    Done: 294/500
    Collected: 294/500
    Done: 294/500
    Collected: 294/500
    Done: 294/500
    Collected: 294/500
    Done: 294/500
    Collected: 294/500
    Done: 294/500
    Collected: 294/500
    Done: 294/500
    Collected: 294/500
    Done: 294/500
    Collected: 294/500
    Done: 294/500
    Collected: 294/500
    Done: 294/500
    Collected: 294/500
    Done: 294/500
    Collected: 294/500
    Done: 294/500
    Collected: 294/500
    Done: 294/500
    Collected: 294/500
    Done: 294/500
    Collected: 294/500
    Done: 294/500
    Collected: 294/500
    Done: 294/500
    Collected: 294/500
   ...

Reason sequential/test-gc-http-client-connaborted
Type JS_TEST_FAILURE
Failed PR 8 (nodejs/node#23168, nodejs/node#23102, nodejs/node#23156, nodejs/node#23184, nodejs/node#23189, nodejs/node#23193, nodejs/node#23196, nodejs/node#23164)
Appeared test-digitalocean-ubuntu1804_container-x64-2, test-digitalocean-alpine37_container-x64-2, test-rackspace-win2008r2-x64-6, test-digitalocean-ubuntu1604_sharedlibs_container-x64-10, test-rackspace-win2012r2-x64-1
First CI https://ci.nodejs.org/job/node-test-pull-request/17511/
Last CI https://ci.nodejs.org/job/node-test-pull-request/17568/
Example
not ok 2308 sequential/test-gc-http-client-connaborted
  ---
  duration_ms: 120.63
  severity: fail
  exitcode: -15
  stack: |-
    timeout
    We should do 500 requests
    Done: 0/500
    Collected: 0/44
    Done: 60/500
    Collected: 0/110
    Done: 121/500
    Collected: 60/165
    Done: 209/500
    Collected: 121/253
    Done: 264/500
    Collected: 209/407
    Done: 264/500
    Collected: 264/500
    Done: 264/500
    Collected: 264/500
    Done: 264/500
    Collected: 264/500
    Done: 264/500
    Collected: 264/500
    Done: 264/500
    Collected: 264/500
    Done: 264/500
    Collected: 264/500
    Done: 264/500
    Collected: 264/500
    Done: 264/500
    Collected: 264/500
    Done: 264/500
    Collected: 264/500
    Done: 264/500
    Collected: 264/500
    Done: 264/500
    Collected: 264/500
    Done: 264/500
    Collected: 264/500
    Done: 264/500
    Collected: 264/500
    Done: 264/500
    Collected: 264/500
    Done: 264/500
    Collected: 264/500
    Done: 264/500
    Collected: 264/500
  ...

Reason message/max_tick_depth
Type JS_TEST_FAILURE
Failed PR 2 (nodejs/node#22631, nodejs/node#23110)
Appeared test-joyent-smartos17-x64-2, undefined
First CI https://ci.nodejs.org/job/node-test-pull-request/17535/
Last CI https://ci.nodejs.org/job/node-test-pull-request/17570/
Example
not ok 2283 message/max_tick_depth
  ---
  duration_ms: 0.344
  severity: crashed
  exitcode: -11
  stack: |-
  ...


Reason parallel/test-crypto-pbkdf2
Type JS_TEST_FAILURE
Failed PR 2 (nodejs/node#23118, nodejs/node#23083)
Appeared test-packetnet-ubuntu1604-arm64-2, test-packetnet-centos7-arm64-1
First CI https://ci.nodejs.org/job/node-test-pull-request/17499/
Last CI https://ci.nodejs.org/job/node-test-pull-request/17522/
Example
not ok 303 parallel/test-crypto-pbkdf2
  ---
  duration_ms: 0.899
  severity: fail
  exitcode: 1
  stack: |-
    (node:50836) [DEP0009] DeprecationWarning: Calling pbkdf2 or pbkdf2Sync with "digest" set to null is deprecated.
    assert.js:84
      throw new AssertionError(obj);
      ^
    
    AssertionError [ERR_ASSERTION]: Expected inputs to be strictly equal:
    + actual - expected
    
    + ')\u001cª:\u0015ä*î\u0012ÁÂ\u0014\u0006tÖ\u001dú�74GäB�;'
    - '4��ÛËÓ+/2Ø\u0014¸\u0011n�Ï+\u00174~¼\u0018\u0000\u0018\u001c'
        at AsyncWrap.crypto.pbkdf2 (/home/iojs/build/workspace/node-test-commit-arm/nodes/ubuntu1604-arm64/test/parallel/test-crypto-pbkdf2.js:23:12)
        at AsyncWrap.wrap.ondone (internal/crypto/pbkdf2.js:38:48)
  ...


Reason parallel/test-crypto-scrypt
Type JS_TEST_FAILURE
Failed PR 2 (nodejs/node#23112, nodejs/node#22381)
Appeared test-packetnet-centos7-arm64-2, test-packetnet-ubuntu1604-arm64-1
First CI https://ci.nodejs.org/job/node-test-pull-request/17526/
Last CI https://ci.nodejs.org/job/node-test-pull-request/17563/
Example
not ok 398 parallel/test-crypto-scrypt
  ---
  duration_ms: 1.542
  severity: fail
  exitcode: 1
  stack: |-
    (node:64956) internal/test/binding: These APIs are exposed only for testing and are not tracked by any versioning system or deprecation process.
    assert.js:84
      throw new AssertionError(obj);
      ^
    
    AssertionError [ERR_ASSERTION]: Expected inputs to be strictly equal:
    + actual - expected
    
    + 'cbe5de2cb18da95f65924505963af32d120351da6df7be8bb0a58027457ed3bd99e0a6effa3dccd5e92d01e89528cf9025a49160c904fbe8fd363b98083ab280'
    - '7023bdcb3afd7348461c06cd81fd38ebfda8fbba904f8e3ea9b543f6545da1f2d5432955613f0fcf62d49705242a9af9e61e85dc0d651e40dfcf017b45575887'
        at AsyncWrap.crypto.scrypt.common.mustCall (/home/iojs/build/workspace/node-test-commit-arm/nodes/centos7-arm64-gcc6/test/parallel/test-crypto-scrypt.js:154:12)
        at AsyncWrap.<anonymous> (/home/iojs/build/workspace/node-test-commit-arm/nodes/centos7-arm64-gcc6/test/common/index.js:349:15)
        at AsyncW...

Reason parallel/test-gc-http-client
Type JS_TEST_FAILURE
Failed PR 2 (nodejs/node#23083, nodejs/node#23118)
Appeared test-digitalocean-alpine37_container-x64-2, test-macstadium-macos10.11-x64-2
First CI https://ci.nodejs.org/job/node-test-pull-request/17486/
Last CI https://ci.nodejs.org/job/node-test-pull-request/17499/
Example
not ok 1155 parallel/test-gc-http-client
  ---
  duration_ms: 120.39
  severity: fail
  exitcode: -15
  stack: |-
    timeout
    We should do 300 requests
    Done: 0/300
    Collected: 0/60
    Done: 14/300
    Collected: 0/75
    Done: 30/300
    Collected: 0/90
    Done: 45/300
    Collected: 14/105
    Done: 60/300
    Collected: 30/120
    Done: 75/300
    Collected: 45/135
    Done: 90/300
    Collected: 60/150
    Done: 105/300
    Collected: 75/165
    Done: 120/300
    Collected: 90/180
    Done: 135/300
    Collected: 105/195
    Done: 149/300
    Collected: 120/210
    Done: 149/300
    Collected: 135/300
    Done: 149/300
    Collected: 149/300
    Done: 149/300
    Collected: 149/300
    Done: 149/300
    Collected: 149/300
    Done: 149/300
    Collected: 149/300
    Done: 149/300
    Collected: 149/300
    Done: 149/300
    Collected: 149/300
    Done: 149/300
    Collected: 149/300
    Done: 149/300
    Collected: 149/300
    Done: 149/300
    Collected: 149/300
    Done: 149/300
    Collecte...

Reason parallel/test-gc-http-client-onerror
Type JS_TEST_FAILURE
Failed PR 2 (nodejs/node#23118, nodejs/node#23134)
Appeared test-joyent-alpine37_container-x64-1, test-digitalocean-alpine37_container-x64-1, test-digitalocean-alpine37_container-x64-2
First CI https://ci.nodejs.org/job/node-test-pull-request/17529/
Last CI https://ci.nodejs.org/job/node-test-pull-request/17576/
Example
not ok 1290 parallel/test-gc-http-client-onerror
  ---
  duration_ms: 120.63
  severity: fail
  exitcode: -15
  stack: |-
    timeout
    We should do 500 requests
    Done: 0/500
    Collected: 0/40
    Done: 40/500
    Collected: 0/80
    Done: 70/500
    Collected: 30/110
    Done: 90/500
    Collected: 50/130
    Done: 130/500
    Collected: 80/170
    Done: 140/500
    Collected: 113/300
    Done: 140/500
    Collected: 140/440
    Done: 140/500
    Collected: 140/500
    Done: 140/500
    Collected: 140/500
    Done: 140/500
    Collected: 140/500
    Done: 140/500
    Collected: 140/500
    Done: 140/500
    Collected: 140/500
    Done: 140/500
    Collected: 140/500
    Done: 140/500
    Collected: 140/500
    Done: 140/500
    Collected: 140/500
    Done: 140/500
    Collected: 140/500
    Done: 140/500
    Collected: 140/500
    Done: 140/500
    Collected: 140/500
    Done: 140/500
    Collected: 140/500
    Done: 140/500
    Collected: 140/500
    Done: 140/500
    Collected: 140/500
    Done: 140...

Reason parallel/test-gc-http-client-timeout
Type JS_TEST_FAILURE
Failed PR 2 (nodejs/node#23134, nodejs/node#23092)
Appeared test-macstadium-macos10.11-x64-1, test-rackspace-ubuntu1604-x64-2
First CI https://ci.nodejs.org/job/node-test-pull-request/17495/
Last CI https://ci.nodejs.org/job/node-test-pull-request/17577/
Example
not ok 797 parallel/test-gc-http-client-timeout
  ---
  duration_ms: 120.40
  severity: fail
  exitcode: -15
  stack: |-
    timeout
    We should do 550 requests
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expected)
    Done: 0/550
    Collected: 0/66
    timeout (expected)
    timeout (expected)
    timeout (expected)
    timeout (expecte...

Reason parallel/test-gc-net-timeout
Type JS_TEST_FAILURE
Failed PR 2 (nodejs/node#23051, nodejs/node#23134)
Appeared test-joyent-smartos16-x64-1, test-digitalocean-alpine37_container-x64-1, test-digitalocean-alpine37_container-x64-2, test-softlayer-centos6-x64-1
First CI https://ci.nodejs.org/job/node-test-pull-request/17520/
Last CI https://ci.nodejs.org/job/node-test-pull-request/17576/
Example
not ok 768 parallel/test-gc-net-timeout
  ---
  duration_ms: 120.356
  severity: fail
  exitcode: -6
  stack: |-
    timeout
    We should do 500 requests
    Done: 44/500
    Collected: 1/55
    Done: 89/500
    Collected: 44/110
    Done: 121/500
    Collected: 77/132
    Done: 154/500
    Collected: 87/165
    Done: 187/500
    Collected: 87/198
    Done: 220/500
    Collected: 87/231
    Done: 253/500
    Collected: 87/264
    Done: 286/500
    Collected: 87/297
    Done: 308/500
    Collected: 87/319
    Done: 330/500
    Collected: 87/341
    Done: 352/500
    Collected: 87/363
    Done: 385/500
    Collected: 87/396
    Done: 407/500
    Collected: 87/418
    Done: 429/500
    Collected: 87/440
    Done: 462/500
    Collected: 87/473
    Done: 484/500
    Collected: 87/495
    Done: 500/500
    Collected: 87/500
    Done: 500/500
    Collected: 87/500
    Done: 500/500
    Collected: 87/500
    Done: 500/500
    Collected: 87/500
    Done: 500/500
    Collected: 87/500
    Done: 500/500
    Collected: ...

Reason parallel/test-http2-client-upload
Type JS_TEST_FAILURE
Failed PR 2 (nodejs/node#23053, nodejs/node#23177)
Appeared test-rackspace-win2012r2-x64-5, test-azure_msft-win10-x64-4
First CI https://ci.nodejs.org/job/node-test-pull-request/17519/
Last CI https://ci.nodejs.org/job/node-test-pull-request/17534/
Example
not ok 251 parallel/test-http2-client-upload
  ---
  duration_ms: 0.282
  severity: fail
  exitcode: 1
  stack: |-
    events.js:167
          throw er; // Unhandled 'error' event
          ^
    
    Error: read ECONNRESET
        at TCP.onStreamRead (internal/stream_base_commons.js:112:27)
    Emitted 'error' event at:
        at emitErrorNT (internal/streams/destroy.js:82:8)
        at emitErrorAndCloseNT (internal/streams/destroy.js:50:3)
        at process._tickCallback (internal/process/next_tick.js:63:19)
  ...


Git Failure

Reason hudson.plugins.git.GitException: Command "git fetch --no-tags --progress [email protected]:nodejs/node.git +refs/heads/:refs/remotes/origin/" returned status code 143:
Type GIT_FAILURE
Failed PR 4 (nodejs/node#23115, nodejs/node#23117, nodejs/node#23120, nodejs/node#23132)
Appeared test-softlayer-ubuntu1604-x64-1
First CI https://ci.nodejs.org/job/node-test-pull-request/17527/
Last CI https://ci.nodejs.org/job/node-test-pull-request/17532/
Example
hudson.plugins.git.GitException: Command "git fetch --no-tags --progress [email protected]:nodejs/node.git +refs/heads/*:refs/remotes/origin/*" returned status code 143:
stdout: 
stderr: remote: Enumerating objects: 45, done.        
remote: Counting objects:   2% (1/45)           remote: Counting objects:   4% (2/45)           remote: Counting objects:   6% (3/45)           remote: Counting objects:   8% (4/45)           remote: Counting objects:  11% (5/45)           remote: Counting objects:  13% (6/45)           remote: Counting objects:  15% (7/45)           remote: Counting objects:  17% (8/45)           remote: Counting objects:  20% (9/45)           remote: Counting objects:  22% (10/45)           remote: Counting objects:  24% (11/45)           remote: Counting objects:  26% (12/45)           remote: Counting objects:  28% (13/45)           remote: Counting objects:  31% (14/45)           remote: Counting objects:  33% (15/45)           remote: Counting objects:  35% (16/45)           remote: Counting o...

Reason hudson.plugins.git.GitException: Command "git fetch --tags --progress [email protected]:nodejs/node.git +refs/heads/:refs/remotes/origin/" returned status code 143:
Type GIT_FAILURE
Failed PR 3 (nodejs/node#23117, nodejs/node#23120, nodejs/node#23132)
Appeared test-softlayer-ubuntu1604-x64-1
First CI https://ci.nodejs.org/job/node-test-pull-request/17528/
Last CI https://ci.nodejs.org/job/node-test-pull-request/17532/
Example
hudson.plugins.git.GitException: Command "git fetch --tags --progress [email protected]:nodejs/node.git +refs/heads/*:refs/remotes/origin/*" returned status code 143:
stdout: 
stderr: remote: Enumerating objects: 45, done.        
remote: Counting objects:   2% (1/45)           remote: Counting objects:   4% (2/45)           remote: Counting objects:   6% (3/45)           remote: Counting objects:   8% (4/45)           remote: Counting objects:  11% (5/45)           remote: Counting objects:  13% (6/45)           remote: Counting objects:  15% (7/45)           remote: Counting objects:  17% (8/45)           remote: Counting objects:  20% (9/45)           remote: Counting objects:  22% (10/45)           remote: Counting objects:  24% (11/45)           remote: Counting objects:  26% (12/45)           remote: Counting objects:  28% (13/45)           remote: Counting objects:  31% (14/45)           remote: Counting objects:  33% (15/45)           remote: Counting objects:  35% (16/45)           remote: Counting obje...
@joyeecheung
Copy link
Member Author

joyeecheung commented Oct 2, 2018

Progress

  • sh: line 42: pgrep: command not found (2)
  • message/max_tick_depth (2)
  • parallel/test-crypto-pbkdf2 (2)
  • parallel/test-crypto-scrypt (2)
  • parallel/test-http2-client-upload (2)
  • hudson.plugins.git.GitException: Command "git fetch --no-tags --progress [email protected]:nodejs/node.git +refs/heads/*:refs/remotes/origin/*" returned status code 143: (4)
  • hudson.plugins.git.GitException: Command "git fetch --tags --progress [email protected]:nodejs/node.git +refs/heads/*:refs/remotes/origin/*" returned status code 143: (3)

In Progress

  • Failed in Propagate Binaries phase (git-nodesource-update-reference) (5): Can't rebuild arm anymore build#1512
  • Marking as flaky in test: mark some flakes node#23208
    • sequential/test-gc-http-client-timeout (12)
    • parallel/test-net-connect-options-port (11)
    • sequential/test-gc-http-client (10)
    • sequential/test-gc-http-client-onerror (10)
    • sequential/test-gc-net-timeout (10)
    • parallel/test-gc-http-client-connaborted (8)
    • sequential/test-gc-http-client-connaborted (8)
    • parallel/test-gc-http-client (2)
    • parallel/test-gc-http-client-onerror (2)
    • parallel/test-gc-http-client-timeout (2)
    • parallel/test-gc-net-timeout (2)

@refack
Copy link

refack commented Oct 2, 2018

  • Failed in Propagate Binaries phase (git-nodesource-update-reference) tracked in Can't rebuild arm anymore build#1512
    ATM node-test-commit-arm-fanned (i.e. the PI job) is disabled.

  • sequential/test-gc-http-client-timeout
    parallel/test-net-connect-options-port
    sequential/test-gc-http-client
    sequential/test-gc-http-client-onerror
    sequential/test-gc-net-timeout
    parallel/test-gc-http-client-connaborted
    sequential/test-gc-http-client-connaborted
    parallel/test-gc-http-client
    parallel/test-gc-http-client-onerror
    parallel/test-gc-http-client-timeout
    parallel/test-gc-net-timeout
    Making as flaky in test: mark some flakes node#23208

@refack
Copy link

refack commented Oct 2, 2018

(BTW I still have not implemented the RESUME_FAILURE idea so some of the build failures/git failures may still be resume failures)

Windows fanned can now resume. RPI job disabled.

@refack
Copy link

refack commented Oct 2, 2018

@Joyee could you add a few lines before the match?
for example
hudson.plugins.git.GitException: Command "git fetch --tags --progress [email protected]:nodejs/node.git +refs/heads/:refs/remotes/origin/" returned status code 143:

Shows:

hudson.plugins.git.GitException: Command "git fetch --tags --progress [email protected]:nodejs/node.git +refs/heads/*:refs/remotes/origin/*" returned status code 143:
stdout: 
stderr: remote: Enumerating objects: 45, done.        
remote: Counting objects:   2% (1/45)           remote: Counting objects:   4% (2/45)

but a key line is 2 line Before:

> git fetch --no-tags --progress [email protected]:nodejs/node.git +refs/heads/*:refs/remotes/origin/* # timeout=20
12:28:22 ERROR: Timeout after 20 minutes
12:28:22 ERROR: Error cloning remote repo 'origin'
12:28:22 hudson.plugins.git.GitException: Command "git fetch --no-tags --progress [email protected]:nodejs/node.git +refs/heads/*:refs/remotes/origin/*" returned status code 143:

@joyeecheung
Copy link
Member Author

joyeecheung commented Oct 2, 2018

@refack I tried to do that before but not every real cause of hudson.plugins.git.GitException is 2 lines before, it seemed (I think I previously set it to 1 for another type of error but then that didn't work on another error and showed a meaningless line), so I compromised to the exception line so that we can still reliably get the command from there.

I could strip the hudson.plugins.git.GitException: part, though, probably makes more sense anyway

@Trott
Copy link
Member

Trott commented Oct 4, 2018

After talking a bit with @addaleax, I did a bisect using a series of stress tests to determine what commit introduced the unreliability for test-gc-net-timeout (and likely the other test-gc-* files). The commit where the problem was introduced is nodejs/node@2790db5.

Taking our "approving the PRl means you are willing to take responsibility for the code" statement seriously, /ping @cjihrig @jasnell @mcollina @BridgeAR

Ref: https://gist.github.com/Trott/d3c3b5e4419497fdc8651ac2d5f805c7

@refack
Copy link

refack commented Oct 4, 2018

Since it's a libuv bump, It might need further unpacking:

https://github.com/libuv/libuv/releases/tag/v1.23.1:
Changes since version 1.23.0:

  • unix,win: limit concurrent DNS calls to nthreads/2 (Anna Henningsen)

  • doc: add addaleax to maintainers (Anna Henningsen)

  • doc: add missing slash in stream.rst (Emil Bay)

  • unix,fs: use utimes & friends for uv_fs_utime (Jeremiah Senkpiel)

  • unix,fs: remove linux fallback from utimesat() (Jeremiah Senkpiel)

  • unix,fs: remove uv__utimesat() syscall fallback (Jeremiah Senkpiel)

  • doc: fix argument name in tcp.rts (Emil Bay)

  • doc: notes on running tests, benchmarks, tools (Jamie Davis)

  • linux: remove epoll syscall wrappers (Ben Noordhuis)

  • linux: drop code path for epoll_pwait-less kernels (Ben Noordhuis)

  • Partially revert "win,code: remove GetQueuedCompletionStatus-based poller"
    (Jameson Nash)

  • build: add compile for android arm64/x86/x86-64 (Andy Zhang)

  • doc: clarify that some remarks apply to windows (Bert Belder)

  • test: fix compiler warnings (Jamie Davis)

  • ibmi: return 0 from uv_resident_set_memory() (dmabupt)

  • win: fix uv_udp_recv_start() error translation (Ryan Liptak)

  • win,doc: improve uv_os_setpriority() documentation (Bartosz Sosnowski)

  • test: increase upper bound in condvar_5 (Jamie Davis)

  • win,tty: remove deadcode (Jameson Nash)

  • stream: autodetect direction (Jameson Nash)

@addaleax
Copy link
Member

addaleax commented Oct 4, 2018

Going through the changelog:

unix,win: limit concurrent DNS calls to nthreads/2 (Anna Henningsen)

libuv/libuv@90891b4 – Wouldn’t be obvious why, but this seems like a potential culprit in that it affects kinda fundamental libuv behaviour cross-platform.

Maybe.

doc: add addaleax to maintainers (Anna Henningsen)

libuv/libuv@1391a3d – I sure hope this isn’t it :)

doc: add missing slash in stream.rst (Emil Bay)

libuv/libuv@ff45b0d – No.

unix,fs: use utimes & friends for uv_fs_utime (Jeremiah Senkpiel)
unix,fs: remove linux fallback from utimesat() (Jeremiah Senkpiel)
unix,fs: remove uv__utimesat() syscall fallback (Jeremiah Senkpiel)

libuv/libuv@ff45b0d...fa5c1d9 – This seems unlikely because it doesn’t affect Windows?

doc: fix argument name in tcp.rts (Emil Bay)

libuv/libuv@89a9ea6 – No.

doc: notes on running tests, benchmarks, tools (Jamie Davis)

libuv/libuv@c0c672e – No.

linux: remove epoll syscall wrappers (Ben Noordhuis)
linux: drop code path for epoll_pwait-less kernels (Ben Noordhuis)

libuv/libuv@c0c672e...f43c663 – This doesn’t affect Windows.

Partially revert "win,code: remove GetQueuedCompletionStatus-based poller"
(Jameson Nash)

libuv/libuv@153ea11 – This only affects Windows, making it very unlikely as well, if I understand correctly?

build: add compile for android arm64/x86/x86-64 (Andy Zhang)

libuv/libuv@baa8146 – No.

doc: clarify that some remarks apply to windows (Bert Belder)

libuv/libuv@baa621c – No.

test: fix compiler warnings (Jamie Davis)

libuv/libuv@abe9e01 – No.

ibmi: return 0 from uv_resident_set_memory() (dmabupt)

libuv/libuv@8813dca – No.

win: fix uv_udp_recv_start() error translation (Ryan Liptak)

libuv/libuv@57b3363 – Only affects UDP on Windows, doesn’t seem likely at all.

win,doc: improve uv_os_setpriority() documentation (Bartosz Sosnowski)

libuv/libuv@b721891 – No.

test: increase upper bound in condvar_5 (Jamie Davis)

libuv/libuv@bb1a49e – No.

win,tty: remove deadcode (Jameson Nash)

libuv/libuv@956bf6b – I’d go with No.

stream: autodetect direction (Jameson Nash)

libuv/libuv@4049879 – Hm… I also wouldn’t understand why, but this is, again, a cross-platform change that affects a number of things. The Windows-specific part is very small, though – essentially one added call to GetNumberOfConsoleInputEvents(). I don’t think that should have any side effects?

Very soft Maybe.


Here’s a stress test for libuv/libuv@90891b4 being reverted:
https://ci.nodejs.org/job/node-stress-single-test/2058/
https://ci.nodejs.org/job/node-stress-single-test/2059/

@refack
Copy link

refack commented Oct 4, 2018

Going through the changelog:

Thanks @addaleax I was just stating to do the same 🥇

On a more systematic approach, I'll try to devise a CI stress test that we should use for future uv and V8 bumps.

@addaleax
Copy link
Member

addaleax commented Oct 4, 2018

I think the root cause might be that my patch prevents all of the 500 DNS requests per test from finishing under some circumstances.

GC might be a red herring here, it might only be showing up in these tests because they are the only ones stress-testing the threadpool like that.

Also, I think I screwed up the stress test because I rebuild one of Rich’s ones, but that was for a commit where the tests were still in parallel/, not sequential/. New attempt: https://ci.nodejs.org/job/node-stress-single-test/2059/

@addaleax
Copy link
Member

addaleax commented Oct 4, 2018

The stress test is green on FreeBSD. That is probably not conclusive evidence, though, because most of Rich's bad runs also seem to have passed on FreeBSD.

I'm building on Windows to see if I can reproduce + try to address the issue there. My first guess would be a race condition on slow_io_work_running. That looks like it's a real bug, it's just not clear whether it's the cause of the test failures. addaleax/node@18fb4ab would be a naive fix for that (although there's probably a way to avoid the extra lock/unlock operations by moving the decrement into the next loop iteration).

addaleax added a commit to addaleax/libuv that referenced this issue Oct 4, 2018
90891b4 introduced a race
condition when accessing `slow_io_work_running` – it is being
increased and later decreased as part of the worker thread loop,
but was accessed with different mutexes during these operations.

This fixes the race condition by making sure both accesses
are protected through the global `mutex` of `threadpool.c`.

This fixes a number of flaky Node.js tests.

Refs: libuv#1845
Refs: nodejs/reliability#18
Refs: nodejs/node#23089
Refs: nodejs/node#23067
Refs: nodejs/node#23066
Refs: nodejs/node#23219
addaleax added a commit to addaleax/node that referenced this issue Oct 4, 2018
90891b4232e91dbd7a2e2077e4d23d16a374b41d introduced a race
condition when accessing `slow_io_work_running` – it is being
increased and later decreased as part of the worker thread loop,
but was accessed with different mutexes during these operations.

This fixes the race condition by making sure both accesses
are protected through the global `mutex` of `threadpool.c`.

This fixes a number of flaky Node.js tests.

Refs: libuv/libuv#1845
Refs: nodejs/reliability#18
Refs: nodejs#23089
Refs: nodejs#23067
Refs: nodejs#23066
Refs: nodejs#23219
@addaleax
Copy link
Member

addaleax commented Oct 4, 2018

The Windows stress test doesn’t have the characteristic failures that we’re looking for, although there are other failures in there:

not ok 1 sequential/test-gc-net-timeout
  ---
  duration_ms: 0.320
  severity: fail
  exitcode: 1
  stack: |-
    We should do 500 requests
    events.js:167
          throw er; // Unhandled 'error' event
          ^
    
    Error: connect EADDRINUSE 127.0.0.1:52620
        at TCPConnectWrap.afterConnect [as oncomplete] (net.js:1113:14)
    Emitted 'error' event at:
        at emitErrorNT (internal/streams/destroy.js:82:8)
        at emitErrorAndCloseNT (internal/streams/destroy.js:50:3)
        at process._tickCallback (internal/process/next_tick.js:63:19)

I’m not sure how that can happen, but I believe it’s independent of the bug that is making the test flaky in non-stress-test situations. Maybe Windows is running out of ports in some way…? @nodejs/platform-windows

@addaleax
Copy link
Member

addaleax commented Oct 4, 2018

Likely libuv fix is up @ libuv/libuv#2021

@Trott
Copy link
Member

Trott commented Oct 4, 2018

The stress test is green on FreeBSD. That is probably not conclusive evidence, though, because most of Rich's bad runs also seem to have passed on FreeBSD.

FreeBSD ended up being a canary. If it was red, something else was wrong than the test. Windows was the definitively problematic host and AIX was also problematic, but at 1/10 the rate that Windows was experiencing.

@richardlau
Copy link
Member

The Windows stress test doesn’t have the characteristic failures that we’re looking for, although there are other failures in there:

not ok 1 sequential/test-gc-net-timeout
  ---
  duration_ms: 0.320
  severity: fail
  exitcode: 1
  stack: |-
    We should do 500 requests
    events.js:167
          throw er; // Unhandled 'error' event
          ^
    
    Error: connect EADDRINUSE 127.0.0.1:52620
        at TCPConnectWrap.afterConnect [as oncomplete] (net.js:1113:14)
    Emitted 'error' event at:
        at emitErrorNT (internal/streams/destroy.js:82:8)
        at emitErrorAndCloseNT (internal/streams/destroy.js:50:3)
        at process._tickCallback (internal/process/next_tick.js:63:19)

I’m not sure how that can happen, but I believe it’s independent of the bug that is making the test flaky in non-stress-test situations. Maybe Windows is running out of ports in some way…? @nodejs/platform-windows

EADDRINUSE is an odd error to get when connecting. libuv does map errors on Windows so it could be one of WSAEADDRINUSE or ERROR_ADDRESS_ALREADY_ASSOCIATED:
https://github.com/libuv/libuv/blob/e4087dedf837f415056a45a838f639a3d9dc3ced/src/win/error.c#L75-L76

@addaleax
Copy link
Member

addaleax commented Oct 5, 2018

The socket's local address is already in use and the socket was not marked to allow address reuse with SO_REUSEADDR.

I guess that could be the case even for connect() without bind()… and this libuv comment (although for server sockets) sounds like setting SO_REUSEADDR would also be a bad idea:

https://github.com/libuv/libuv/blob/v1.x/src/win/tcp.c#L283-L291

addaleax added a commit to libuv/libuv that referenced this issue Oct 7, 2018
90891b4 introduced a race
condition when accessing `slow_io_work_running` – it is being
increased and later decreased as part of the worker thread loop,
but was accessed with different mutexes during these operations.

This fixes the race condition by making sure both accesses
are protected through the global `mutex` of `threadpool.c`.

This fixes a number of flaky Node.js tests.

Refs: #1845
Refs: nodejs/reliability#18
Refs: nodejs/node#23089
Refs: nodejs/node#23067
Refs: nodejs/node#23066
Refs: nodejs/node#23219
PR-URL: #2021
Reviewed-By: Santiago Gimeno <[email protected]>
Reviewed-By: Colin Ihrig <[email protected]>
Reviewed-By: Ben Noordhuis <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants