Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Nomad client high memory usage #11547

Closed
bubejur opened this issue Nov 22, 2021 · 38 comments · Fixed by #11741
Closed

Nomad client high memory usage #11547

bubejur opened this issue Nov 22, 2021 · 38 comments · Fixed by #11741

Comments

@bubejur
Copy link

bubejur commented Nov 22, 2021

Nomad version

Nomad v1.1.6 (b83d623)

Operating system and Environment details

Centos7

Issue

High memory usage by nomad processes

Reproduction steps

It's not frequently issue, but you can try to reproduce

Expected Result

not so much memory consumption

Actual Result

it's currenlty too big

Job file (if appropriate)

job "PsHub_Host" {
  datacenters = ["nl5"]
  type = "service"

  update {
    max_parallel     = 50
    health_check     = "checks"
    min_healthy_time = "10s"
    healthy_deadline = "10m"
    progress_deadline = "15m"
    auto_revert      = true
    canary           = 0
    stagger          = "30s"
  }

  group "PsHub_Host-group-1" {
    count = 50

    restart {
      interval = "1m"
      attempts = 10
      delay    = "1s"
      mode     = "delay"
    }

    # disk is not actually a valuable attribute here due to raw exec driver
    ephemeral_disk {
      size = 51
    }

    task "PsHub_Host-worker-1" {
      driver = "raw_exec"

      constraint {
        attribute = "${meta.type}"
        value     = "PS_HOST"
      }

      constraint {
        attribute = "${meta.line_version}"
        value = "1"
      }

      constraint {
        attribute = "${meta.line_type}"
        value     = "production"
      }

      resources {
        cpu = 100
        memory = 128
      }

      # max_files * max_file_size should be < ephemeral_disk size
      logs {
        max_files     = 5
        max_file_size = 10
      }

      config {
        command = "/usr/bin/php"
        args = [
          "/data/ecogate/cli/worker.php", "PsHub_Host"
        ]
      }

      leader = false
      kill_timeout = "90s"
    }

    # TODO log-shipper task
  }

  group "PsHub_Host-group-2" {
    count = 50

    restart {
      interval = "1m"
      attempts = 10
      delay    = "1s"
      mode     = "delay"
    }

    # disk is not actually a valuable attribute here due to raw exec driver
    ephemeral_disk {
      size = 51
    }

    task "PsHub_Host-worker-2" {
      driver = "raw_exec"

      constraint {
        attribute = "${meta.type}"
        value     = "PS_HOST"
      }

      constraint {
        attribute = "${meta.line_version}"
        value = "2"
      }

      constraint {
        attribute = "${meta.line_type}"
        value     = "production"
      }

      resources {
        cpu = 100
        memory = 128
      }

      # max_files * max_file_size should be < ephemeral_disk size
      logs {
        max_files     = 5
        max_file_size = 10
      }

      config {
        command = "/usr/bin/php"
        args = [
          "/data/ecogate/cli/worker.php", "PsHub_Host"
        ]
      }

      leader = false
      kill_timeout = "90s"
    }

    # TODO log-shipper task
  }
}

Nomad Client logs (if appropriate)

I guess it's nothing but this:
Nov 21 04:50:51 worker36.2p-l1.nl5 consul[1044]: 2021-11-21T04:50:51.890Z [WARN]  agent: grpc: addrConn.createTransport failed to connect to {nl5-10.19.4.252:8300 0 nomad02.inf.nl5 <nil>}. Err :connection error: desc = "transport: Error while dialing dial tcp 10.19.19.136:0->10.19.4.252:8300: operation was canceled". Reconnecting...
Nov 21 04:50:51 worker36.2p-l1.nl5 consul: 2021-11-21T04:50:51.890Z [WARN]  agent: grpc: addrConn.createTransport failed to connect to {nl5-10.19.4.252:8300 0 nomad02.inf.nl5 <nil>}. Err :connection error: desc = "transport: Error while dialing dial tcp 10.19.19.136:0->10.19.4.252:8300: operation was canceled". Reconnecting...

image

@lgfa29
Copy link
Contributor

lgfa29 commented Nov 25, 2021

Thanks for the report @bubejur

Does this happen in clients or servers? Or does it affect multiple agents?

And just to give us an idea of scale, would you be able to share rough estimates of you cluster size? Like number of servers, clients, jobs etc.

Thanks!

@bubejur
Copy link
Author

bubejur commented Nov 25, 2021

It happens on clients. Affect multiple agents/not all. I think cluster size is not a reason.

@bubejur
Copy link
Author

bubejur commented Nov 27, 2021

we already done something with that with @tgross

@bubejur
Copy link
Author

bubejur commented Nov 27, 2021

but memory leak is still exists

@bubejur
Copy link
Author

bubejur commented Nov 29, 2021

old issue is: #9858

@anastazya
Copy link

We are seeing similar behaviour throughout all our workers, the same OOM effect but now the progression is slower ( ~ 3 weeks ). We are investigating this and we will post more relevant data as we have.

@anastazya
Copy link

In order to try and help, i'm providing output of ps -eo pid,tid,class,rtprio,stat,vsz,rss,comm of several production nodes, below

   WORKER25

   PID    TID CLS RTPRIO STAT    VSZ   RSS COMMAND
     1      1 TS       - Ss    52760  4752 systemd
     2      2 TS       - S         0     0 kthreadd
     4      4 TS       - S<        0     0 kworker/0:0H
     6      6 TS       - S         0     0 ksoftirqd/0
     7      7 FF      99 S         0     0 migration/0
     8      8 TS       - S         0     0 rcu_bh
     9      9 TS       - S         0     0 rcu_sched
    10     10 TS       - S<        0     0 lru-add-drain
    11     11 FF      99 S         0     0 watchdog/0
    12     12 FF      99 S         0     0 watchdog/1
    13     13 FF      99 S         0     0 migration/1
    14     14 TS       - S         0     0 ksoftirqd/1
    16     16 TS       - S<        0     0 kworker/1:0H
    17     17 FF      99 S         0     0 watchdog/2
    18     18 FF      99 S         0     0 migration/2
    19     19 TS       - S         0     0 ksoftirqd/2
    21     21 TS       - S<        0     0 kworker/2:0H
    22     22 FF      99 S         0     0 watchdog/3
    23     23 FF      99 S         0     0 migration/3
    24     24 TS       - S         0     0 ksoftirqd/3
    26     26 TS       - S<        0     0 kworker/3:0H
    27     27 FF      99 S         0     0 watchdog/4
    28     28 FF      99 S         0     0 migration/4
    29     29 TS       - S         0     0 ksoftirqd/4
    31     31 TS       - S<        0     0 kworker/4:0H
    32     32 FF      99 S         0     0 watchdog/5
    33     33 FF      99 S         0     0 migration/5
    34     34 TS       - S         0     0 ksoftirqd/5
    36     36 TS       - S<        0     0 kworker/5:0H
    37     37 FF      99 S         0     0 watchdog/6
    38     38 FF      99 S         0     0 migration/6
    39     39 TS       - S         0     0 ksoftirqd/6
    41     41 TS       - S<        0     0 kworker/6:0H
    42     42 FF      99 S         0     0 watchdog/7
    43     43 FF      99 S         0     0 migration/7
    44     44 TS       - S         0     0 ksoftirqd/7
    46     46 TS       - S<        0     0 kworker/7:0H
    47     47 FF      99 S         0     0 watchdog/8
    48     48 FF      99 S         0     0 migration/8
    49     49 TS       - S         0     0 ksoftirqd/8
    51     51 TS       - S<        0     0 kworker/8:0H
    52     52 FF      99 S         0     0 watchdog/9
    53     53 FF      99 S         0     0 migration/9
    54     54 TS       - S         0     0 ksoftirqd/9
    56     56 TS       - S<        0     0 kworker/9:0H
    57     57 FF      99 S         0     0 watchdog/10
    58     58 FF      99 S         0     0 migration/10
    59     59 TS       - S         0     0 ksoftirqd/10
    61     61 TS       - S<        0     0 kworker/10:0H
    62     62 FF      99 S         0     0 watchdog/11
    63     63 FF      99 S         0     0 migration/11
    64     64 TS       - S         0     0 ksoftirqd/11
    66     66 TS       - S<        0     0 kworker/11:0H
    68     68 TS       - S         0     0 kdevtmpfs
    69     69 TS       - S<        0     0 netns
    70     70 TS       - S         0     0 khungtaskd
    71     71 TS       - S<        0     0 writeback
    72     72 TS       - S<        0     0 kintegrityd
    73     73 TS       - S<        0     0 bioset
    74     74 TS       - S<        0     0 bioset
    75     75 TS       - S<        0     0 bioset
    76     76 TS       - S<        0     0 kblockd
    77     77 TS       - S<        0     0 md
    78     78 TS       - S<        0     0 edac-poller
    79     79 TS       - S<        0     0 watchdogd
    85     85 TS       - S         0     0 kswapd0
    86     86 TS       - SN        0     0 ksmd
    87     87 TS       - SN        0     0 khugepaged
    88     88 TS       - S<        0     0 crypto
    96     96 TS       - S<        0     0 kthrotld
    98     98 TS       - S<        0     0 kmpath_rdacd
    99     99 TS       - S<        0     0 kaluad
   100    100 TS       - S<        0     0 kpsmoused
   101    101 TS       - S<        0     0 ipv6_addrconf
   115    115 TS       - S<        0     0 deferwq
   153    153 TS       - S         0     0 kauditd
   361    361 TS       - S<        0     0 ata_sff
   365    365 TS       - S         0     0 scsi_eh_0
   366    366 TS       - S<        0     0 scsi_tmf_0
   367    367 TS       - S         0     0 scsi_eh_1
   368    368 TS       - S<        0     0 scsi_tmf_1
   373    373 TS       - S<        0     0 ttm_swap
   380    380 TS       - S<        0     0 virtscsi-scan
   381    381 TS       - S         0     0 scsi_eh_2
   382    382 TS       - S<        0     0 scsi_tmf_2
   418    418 TS       - S<        0     0 kworker/1:1H
   422    422 TS       - S<        0     0 kworker/0:1H
   460    460 TS       - S<        0     0 kdmflush
   461    461 TS       - S<        0     0 bioset
   472    472 TS       - S<        0     0 kdmflush
   473    473 TS       - S<        0     0 bioset
   486    486 TS       - S<        0     0 bioset
   487    487 TS       - S<        0     0 xfsalloc
   488    488 TS       - S<        0     0 xfs_mru_cache
   489    489 TS       - S<        0     0 xfs-buf/dm-0
   490    490 TS       - S<        0     0 xfs-data/dm-0
   491    491 TS       - S<        0     0 xfs-conv/dm-0
   492    492 TS       - S<        0     0 xfs-cil/dm-0
   493    493 TS       - S<        0     0 xfs-reclaim/dm-
   494    494 TS       - S<        0     0 xfs-log/dm-0
   495    495 TS       - S<        0     0 xfs-eofblocks/d
   496    496 TS       - S         0     0 xfsaild/dm-0
   579    579 TS       - Ss   203308 114388 systemd-journal
   598    598 TS       - Ss   198572  1352 lvmetad
   616    616 TS       - Ss    45356  1844 systemd-udevd
   654    654 TS       - S         0     0 hwrng
   666    666 TS       - S<        0     0 kworker/5:1H
   723    723 TS       - S<        0     0 nfit
   741    741 TS       - S         0     0 jbd2/sda1-8
   742    742 TS       - S<        0     0 ext4-rsv-conver
   781    781 TS       - S<sl  55532  1040 auditd
   785    785 TS       - S<        0     0 rpciod
   786    786 TS       - S<        0     0 xprtiod
   809    809 TS       - Ss    12032   936 statsd-aggregat
   810    810 TS       - Ss    69256  1400 rpcbind
   824    824 TS       - Ss    58220  2424 dbus-daemon
   829    829 TS       - Ss    47296  2456 ntpd
   834    834 TS       - Ss    21592  1284 irqbalance
   854    854 TS       - Ssl  612360 11176 polkitd
   857    857 TS       - Ss    44352  2572 qemu-ga
   859    859 TS       - Ssl  195208  2016 gssproxy
   882    882 TS       - Ss    26812  2236 systemd-logind
   932    932 TS       - Ssl  528040 17816 python
   943    943 TS       - Ss   126420  1640 crond
  1001   1001 TS       - S<        0     0 kworker/3:1H
  1002   1002 TS       - S<        0     0 kworker/4:1H
  1058   1058 TS       - Ss   116760  4408 lldpd
  1059   1059 TS       - Ssl  574280 19340 tuned
  1060   1060 TS       - Ssl  251356 43268 puppet
  1061   1061 TS       - Ssl  698612 56964 mcollectived
  1063   1063 TS       - S    116760  1616 lldpd
  1074   1074 TS       - Ss   112940  4360 sshd
  1079   1079 TS       - Ssl  698336 65460 rsyslogd
  1114   1114 TS       - Ssl  793528  2548 collectd
  1168   1168 TS       - Ssl  1574852 35248 containerd
  1210   1210 TS       - Ss   100072 18548 haproxy
  1216   1216 TS       - SNsl 117444  5084 osqueryd
  1415   1415 TS       - SNl  816540 41400 osqueryd
  1457   1457 TS       - Ss+  110204   864 agetty
  1463   1463 TS       - Ssl  134592 16016 unbound
  1507   1507 TS       - S<        0     0 kworker/11:1H
  2013   2013 TS       - S<        0     0 kworker/6:1H
  2014   2014 TS       - S<        0     0 kworker/7:1H
  2698   2698 TS       - S<        0     0 kworker/10:1H
  2749   2749 TS       - S<        0     0 kworker/2:1H
  3342   3342 TS       - S<        0     0 kworker/9:1H
  3597   3597 TS       - Sl   1451428 15556 nomad
  3613   3613 TS       - Sl   1516708 15592 nomad
  3656   3656 TS       - Sl   1647844 15504 nomad
  5322   5322 TS       - S<        0     0 kworker/8:1H
  7497   7497 TS       - Ssl  1153192 56248 dockerd
  7703   7703 TS       - Ssl  8015548 5989664 nomad
  8149   8149 TS       - S         0     0 kworker/6:1
  9834   9834 TS       - S         0     0 kworker/5:2
  9990   9990 TS       - Ssl  1724148 19712 nomad
 10004  10004 TS       - S    696172 153272 php
 10619  10619 TS       - Ssl  1593332 20380 nomad
 10633  10633 TS       - S    689788 148932 php
 10801  10801 TS       - Ssl  1658868 21284 nomad
 10816  10816 TS       - S    547864 93532 php
 11116  11116 TS       - Ssl  1527796 21144 nomad
 11130  11130 TS       - S    691304 149408 php
 11767  11767 TS       - S         0     0 kworker/6:2
 12193  12193 TS       - S         0     0 kworker/11:1
 14280  14280 TS       - S         0     0 kworker/8:2
 16023  16023 TS       - S         0     0 kworker/2:1
 20181  20181 TS       - S     89852  4100 pickup
 21170  21170 TS       - Ssl  1592820 20656 nomad
 21185  21185 TS       - S    527168 72920 php
 21349  21349 TS       - Ssl  1593076 20788 nomad
 21364  21364 TS       - S    562664 106324 php
 22804  22804 TS       - Ssl  1593076 20604 nomad
 22820  22820 TS       - S    636436 97848 php
 23196  23196 TS       - Ssl  1584880 20240 nomad
 23210  23210 TS       - S    561876 106188 php
 23352  23352 TS       - S         0     0 kworker/1:1
 23790  23790 TS       - S         0     0 kworker/3:1
 23971  23971 TS       - Ssl  1518832 20584 nomad
 23985  23985 TS       - S    526876 71164 php
 28259  28259 TS       - Ssl  1584368 19832 nomad
 28272  28272 TS       - S    527020 72396 php
 29484  29484 TS       - S         0     0 kworker/2:0
 30919  30919 TS       - Sl   1692180 22356 sensu-client
 32107  32107 TS       - Ssl  1519344 20400 nomad
 32122  32122 TS       - S    549928 95312 php
 32228  32228 TS       - Ssl  1519856 21056 nomad
 32242  32242 TS       - S    617000 78688 php
 32566  32566 TS       - Sl   1648100 14524 nomad
 32567  32567 TS       - Sl   1516964 14776 nomad
 34453  34453 TS       - Ssl  1518832 20092 nomad
 34468  34468 TS       - S    644912 104284 php
 35557  35557 TS       - Ssl  1650160 20476 nomad
 35573  35573 TS       - S    527020 72404 php
 36522  36522 TS       - S         0     0 kworker/4:2
 36586  36586 TS       - S         0     0 kworker/0:0
 36650  36650 TS       - S         0     0 kworker/2:2
 36795  36795 TS       - S         0     0 kworker/8:0
 36855  36855 TS       - Ssl  1519344 20760 nomad
 36869  36869 TS       - S    655464 115236 php
 37850  37850 TS       - S    182504  2600 crond
 37853  37853 TS       - Ss   113280  1200 sh
 37854  37854 TS       - S    113284  1436 puppet_cron.sh
 38290  38290 TS       - Ssl  1650416 20604 nomad
 38304  38304 TS       - S    526976 71868 php
 38434  38434 TS       - Ssl  1584880 20436 nomad
 38464  38464 TS       - S    531836 77676 php
 38480  38480 TS       - Ssl  1388272 20652 nomad
 38481  38481 TS       - Ssl  1584624 20820 nomad
 38506  38506 TS       - S    571648 116752 php
 38507  38507 TS       - S    561944 107152 php
 38548  38548 TS       - Ssl  1518832 20632 nomad
 38564  38564 TS       - S    573680 119772 php
 38993  38993 TS       - Ssl  1781232 19812 nomad
 39007  39007 TS       - S    526876 71196 php
 39008  39008 TS       - Ssl  1461748 20016 nomad
 39021  39021 TS       - S    526876 71204 php
 39035  39035 TS       - Ssl  1584624 20224 nomad
 39036  39036 TS       - Ssl  1650416 20688 nomad
 39062  39062 TS       - S    526976 71188 php
 39064  39064 TS       - S    526876 71192 php
 39078  39078 TS       - Ssl  1461492 20464 nomad
 39092  39092 TS       - S    526876 71196 php
 39507  39507 TS       - Ssl  1650160 20588 nomad
 39521  39521 TS       - S    530844 76776 php
 39994  39994 TS       - Ssl  1650416 20544 nomad
 40009  40009 TS       - S    577016 121916 php
 40155  40155 TS       - Ssl  1584112 20544 nomad
 40169  40169 TS       - S    526876 71192 php
 40262  40262 TS       - Ssl  1584112 20732 nomad
 40288  40288 TS       - S    533688 78772 php
 42446  42446 TS       - S         0     0 kworker/7:2
 43609  43609 TS       - Ssl  1518832 19976 nomad
 43625  43625 TS       - S    529056 73520 php
 43953  43953 TS       - Ssl  1649904 20588 nomad
 43968  43968 TS       - S    526876 71704 php
 43988  43988 TS       - Ssl  1519088 20688 nomad
 44003  44003 TS       - S    526876 71152 php
 44302  44302 TS       - Ssl  1453552 20008 nomad
 44317  44317 TS       - S    524972 70728 php
 44655  44655 TS       - S    108052   348 sleep
 44822  44822 TS       - Ssl  1649648 20176 nomad
 44837  44837 TS       - S    616324 77528 php
 44956  44956 TS       - Ssl  1650416 20248 nomad
 44972  44972 TS       - S    577312 122176 php
 45098  45098 TS       - Ssl  1658356 20644 nomad
 45112  45112 TS       - S    524972 70712 php
 45136  45136 TS       - Ssl  1649904 19904 nomad
 45150  45150 TS       - S    619672 80976 php
 45232  45232 TS       - Ssl  1649648 20632 nomad
 45246  45246 TS       - S    616184 77528 php
 45456  45456 TS       - Ssl  1519088 19968 nomad
 45471  45471 TS       - S    524972 70720 php
 45561  45561 TS       - Ssl  1510380 19908 nomad
 45576  45576 TS       - S    527352 72556 php
 45602  45602 TS       - Ssl  1453296 19772 nomad
 45617  45617 TS       - S    619596 81280 php
 45844  45844 TS       - Ssl  1649904 20724 nomad
 45858  45858 TS       - S    524972 70696 php
 45949  45949 TS       - Ssl  1584624 20012 nomad
 45963  45963 TS       - S    524928 70740 php
 46109  46109 TS       - Ssl  1650160 19544 nomad
 46119  46119 TS       - S    524972 70712 php
 46129  46129 TS       - Ssl  1584368 20892 nomad
 46143  46143 TS       - S    524928 70696 php
 46192  46192 TS       - Ssl  1584368 19768 nomad
 46207  46207 TS       - S    524972 70696 php
 46245  46245 TS       - Ssl  1518576 19948 nomad
 46259  46259 TS       - S    575684 121504 php
 46453  46453 TS       - Ssl  1715440 19836 nomad
 46467  46467 TS       - S    647552 190896 php
 46590  46590 TS       - Ssl  1510380 19976 nomad
 46605  46605 TS       - S    524972 70700 php
 46855  46855 TS       - Ssl  1650416 20532 nomad
 46869  46869 TS       - S    524972 70724 php
 46878  46878 TS       - Ssl  1649904 20004 nomad
 46887  46887 TS       - S    524972 70728 php
 46977  46977 TS       - Ssl  1502184 20264 nomad
 46991  46991 TS       - S    524972 70704 php
 47085  47085 TS       - Ssl  1650160 20000 nomad
 47100  47100 TS       - S    531528 76288 php
 47168  47168 TS       - Ssl  1584368 19920 nomad
 47183  47183 TS       - S    526876 71180 php
 47266  47266 TS       - Ssl  720648 16928 node_exporter
 47331  47331 TS       - Ssl  1584624 20404 nomad
 47346  47346 TS       - S    524972 70696 php
 47714  47714 TS       - Ssl  1575404 19496 nomad
 47727  47727 TS       - S    524972 70740 php
 47811  47811 TS       - Ssl  1518576 20804 nomad
 47825  47825 TS       - S    651688 119584 php
 47896  47896 TS       - Ssl  1501928 20544 nomad
 47911  47911 TS       - S    524972 70720 php
 47920  47920 TS       - Ssl  1493732 19960 nomad
 47934  47934 TS       - S    670844 191496 php
 48153  48153 TS       - Ssl  1436648 19812 nomad
 48168  48168 TS       - S    524972 70728 php
 48175  48175 TS       - Ssl  1567208 19468 nomad
 48189  48189 TS       - S    545664 90228 php
 48231  48231 TS       - Ss   158944  5660 sshd
 48257  48257 TS       - D    159256  2624 sshd
 48258  48258 TS       - Ss   116476  3020 bash
 48296  48296 TS       - S    241368  4740 sudo
 48297  48297 TS       - S    191876  2356 su
 48298  48298 TS       - S    116492  3052 bash
 48332  48332 TS       - Ssl  1436136 19364 nomad
 48346  48346 TS       - S    628828 88404 php
 48427  48427 TS       - Ssl  1346268 19816 nomad
 48441  48441 TS       - S    530424 76624 php
 48454  48454 TS       - Ssl  1215196 19436 nomad
 48468  48468 TS       - S    540644 86904 php
 48469  48469 TS       - Ssl  1280220 19036 nomad
 48483  48483 TS       - S    526876 71164 php
 48497  48497 TS       - Ssl  1485280 18816 nomad
 48512  48512 TS       - S    526876 71200 php
 48522  48522 TS       - R+   153324  1504 ps
 81187  81187 TS       - Sl   914716 24236 haproxy
 90692  90692 TS       - S         0     0 kworker/11:0
 91784  91784 TS       - S         0     0 kworker/u384:0
 94145  94145 TS       - Sl   1582244 14836 nomad
 94191  94191 TS       - Sl   1582564 14416 nomad
 94249  94249 TS       - Sl   1451172 14684 nomad
 94284  94284 TS       - Sl   1581796 14616 nomad
 94356  94356 TS       - Sl   1647524 14888 nomad
 94383  94383 TS       - Sl   1647780 14440 nomad
 94384  94384 TS       - Sl   1516772 14468 nomad
 94520  94520 TS       - Sl   1451492 15040 nomad
 94608  94608 TS       - Sl   1450916 14652 nomad
 94829  94829 TS       - Sl   1582244 14892 nomad
 94865  94865 TS       - Sl   1648036 15180 nomad
 94942  94942 TS       - Sl   1451172 14592 nomad
 95069  95069 TS       - Sl   1516452 14788 nomad
 95106  95106 TS       - Sl   1582500 15096 nomad
 95143  95143 TS       - Sl   1656232 14700 nomad
 95183  95183 TS       - Sl   1516772 15248 nomad
 95689  95689 TS       - Sl   1450916 14344 nomad
 95748  95748 TS       - Sl   1450916 14388 nomad
 95796  95796 TS       - Sl   1385124 14652 nomad
 95830  95830 TS       - Sl   1516708 14556 nomad
 96005  96005 TS       - Sl   1647524 14644 nomad
 96104  96104 TS       - Sl   1517028 15012 nomad
 96137  96137 TS       - Sl   1581988 14872 nomad
 96293  96293 TS       - Sl   1582564 14728 nomad
 96353  96353 TS       - Sl   1516452 14672 nomad
 96446  96446 TS       - Sl   1516580 14864 nomad
 96518  96518 TS       - Sl   1516452 14228 nomad
 96740  96740 TS       - Sl   1525160 16136 nomad
 96783  96783 TS       - Sl   1582564 15256 nomad
 96839  96839 TS       - Sl   1525160 17156 nomad
 96937  96937 TS       - Sl   1451236 15092 nomad
 96987  96987 TS       - Sl   1451492 14772 nomad
 97058  97058 TS       - Sl   1516964 15240 nomad
 97115  97115 TS       - Sl   1451172 14560 nomad
 97179  97179 TS       - Sl   1451236 14624 nomad
 97219  97219 TS       - Sl   1590696 17596 nomad
 97251  97251 TS       - Sl   1459880 16916 nomad
 97318  97318 TS       - Sl   1648036 15184 nomad
 97361  97361 TS       - Sl   1516516 14676 nomad
 97418  97418 TS       - Sl   1647012 14616 nomad
 97471  97471 TS       - Sl   1582500 14980 nomad
 97512  97512 TS       - Sl   1516196 14332 nomad
 97575  97575 TS       - Sl   1525224 17632 nomad
 97677  97677 TS       - Sl   1516708 14588 nomad
 97729  97729 TS       - Sl   1516964 15828 nomad
 97776  97776 TS       - Sl   1582564 15152 nomad
 97816  97816 TS       - Sl   1582500 15160 nomad
 97868  97868 TS       - Sl   1517220 14512 nomad
 97936  97936 TS       - Sl   1516708 14564 nomad
 97974  97974 TS       - Sl   1450980 15180 nomad
 98182  98182 TS       - Sl   1582244 14860 nomad
109272 109272 TS       - S         0     0 kworker/7:1
114348 114348 TS       - Ssl  854044 118264 consul
119099 119099 TS       - S         0     0 kworker/10:2
127048 127048 TS       - S         0     0 kworker/1:2
130846 130846 TS       - S         0     0 kworker/4:1
131939 131939 TS       - Ssl   40540 22144 consul-template
151492 151492 TS       - S         0     0 kworker/0:1
154777 154777 TS       - S         0     0 kworker/9:0
165022 165022 TS       - Sl   1581540 14332 nomad
165034 165034 TS       - Sl   1524904 17376 nomad
165087 165087 TS       - Sl   1647780 14824 nomad
165121 165121 TS       - Sl   1450404 14848 nomad
165165 165165 TS       - Sl   1450660 14512 nomad
165177 165177 TS       - Sl   1516708 15188 nomad
165209 165209 TS       - Sl   1319588 14460 nomad
165244 165244 TS       - Sl   1451428 14956 nomad
165272 165272 TS       - Sl   1647524 14448 nomad
165320 165320 TS       - Sl   1517220 14224 nomad
173411 173411 TS       - SNl  5387976 524076 java
176073 176073 TS       - S         0     0 kworker/9:2
182634 182634 TS       - Sl   1516452 14408 nomad
184025 184025 TS       - S         0     0 kworker/5:0
184867 184867 TS       - S         0     0 kworker/3:0
184868 184868 TS       - S         0     0 kworker/10:0
188342 188342 TS       - Ssl  907840 103796 vault
189736 189736 TS       - Ss    89748  2176 master
189738 189738 TS       - S     90032  4268 qmgr
190244 190244 TS       - S         0     0 kworker/u384:1

   WORKER27

   PID    TID CLS RTPRIO STAT    VSZ   RSS COMMAND
     1      1 TS       - Ss    52608  4724 systemd
     2      2 TS       - S         0     0 kthreadd
     4      4 TS       - S<        0     0 kworker/0:0H
     6      6 TS       - S         0     0 ksoftirqd/0
     7      7 FF      99 S         0     0 migration/0
     8      8 TS       - S         0     0 rcu_bh
     9      9 TS       - S         0     0 rcu_sched
    10     10 TS       - S<        0     0 lru-add-drain
    11     11 FF      99 S         0     0 watchdog/0
    12     12 FF      99 S         0     0 watchdog/1
    13     13 FF      99 S         0     0 migration/1
    14     14 TS       - S         0     0 ksoftirqd/1
    16     16 TS       - S<        0     0 kworker/1:0H
    17     17 FF      99 S         0     0 watchdog/2
    18     18 FF      99 S         0     0 migration/2
    19     19 TS       - S         0     0 ksoftirqd/2
    21     21 TS       - S<        0     0 kworker/2:0H
    22     22 FF      99 S         0     0 watchdog/3
    23     23 FF      99 S         0     0 migration/3
    24     24 TS       - S         0     0 ksoftirqd/3
    26     26 TS       - S<        0     0 kworker/3:0H
    27     27 FF      99 S         0     0 watchdog/4
    28     28 FF      99 S         0     0 migration/4
    29     29 TS       - S         0     0 ksoftirqd/4
    31     31 TS       - S<        0     0 kworker/4:0H
    32     32 FF      99 S         0     0 watchdog/5
    33     33 FF      99 S         0     0 migration/5
    34     34 TS       - S         0     0 ksoftirqd/5
    36     36 TS       - S<        0     0 kworker/5:0H
    37     37 FF      99 S         0     0 watchdog/6
    38     38 FF      99 S         0     0 migration/6
    39     39 TS       - S         0     0 ksoftirqd/6
    41     41 TS       - S<        0     0 kworker/6:0H
    42     42 FF      99 S         0     0 watchdog/7
    43     43 FF      99 S         0     0 migration/7
    44     44 TS       - S         0     0 ksoftirqd/7
    46     46 TS       - S<        0     0 kworker/7:0H
    47     47 FF      99 S         0     0 watchdog/8
    48     48 FF      99 S         0     0 migration/8
    49     49 TS       - S         0     0 ksoftirqd/8
    51     51 TS       - S<        0     0 kworker/8:0H
    52     52 FF      99 S         0     0 watchdog/9
    53     53 FF      99 S         0     0 migration/9
    54     54 TS       - S         0     0 ksoftirqd/9
    56     56 TS       - S<        0     0 kworker/9:0H
    57     57 FF      99 S         0     0 watchdog/10
    58     58 FF      99 S         0     0 migration/10
    59     59 TS       - S         0     0 ksoftirqd/10
    61     61 TS       - S<        0     0 kworker/10:0H
    62     62 FF      99 S         0     0 watchdog/11
    63     63 FF      99 S         0     0 migration/11
    64     64 TS       - S         0     0 ksoftirqd/11
    66     66 TS       - S<        0     0 kworker/11:0H
    68     68 TS       - S         0     0 kdevtmpfs
    69     69 TS       - S<        0     0 netns
    70     70 TS       - S         0     0 khungtaskd
    71     71 TS       - S<        0     0 writeback
    72     72 TS       - S<        0     0 kintegrityd
    73     73 TS       - S<        0     0 bioset
    74     74 TS       - S<        0     0 bioset
    75     75 TS       - S<        0     0 bioset
    76     76 TS       - S<        0     0 kblockd
    77     77 TS       - S<        0     0 md
    78     78 TS       - S<        0     0 edac-poller
    79     79 TS       - S<        0     0 watchdogd
    85     85 TS       - S         0     0 kswapd0
    86     86 TS       - SN        0     0 ksmd
    87     87 TS       - SN        0     0 khugepaged
    88     88 TS       - S<        0     0 crypto
    96     96 TS       - S<        0     0 kthrotld
    98     98 TS       - S<        0     0 kmpath_rdacd
    99     99 TS       - S<        0     0 kaluad
   101    101 TS       - S<        0     0 kpsmoused
   102    102 TS       - S<        0     0 ipv6_addrconf
   115    115 TS       - S<        0     0 deferwq
   152    152 TS       - S         0     0 kauditd
   359    359 TS       - S<        0     0 ata_sff
   364    364 TS       - S         0     0 scsi_eh_0
   365    365 TS       - S<        0     0 scsi_tmf_0
   366    366 TS       - S         0     0 scsi_eh_1
   367    367 TS       - S<        0     0 scsi_tmf_1
   370    370 TS       - S<        0     0 ttm_swap
   379    379 TS       - S<        0     0 virtscsi-scan
   380    380 TS       - S         0     0 scsi_eh_2
   381    381 TS       - S<        0     0 scsi_tmf_2
   415    415 TS       - S<        0     0 kworker/1:1H
   419    419 TS       - S<        0     0 kworker/0:1H
   457    457 TS       - S<        0     0 kdmflush
   458    458 TS       - S<        0     0 bioset
   469    469 TS       - S<        0     0 kdmflush
   470    470 TS       - S<        0     0 bioset
   483    483 TS       - S<        0     0 bioset
   484    484 TS       - S<        0     0 xfsalloc
   485    485 TS       - S<        0     0 xfs_mru_cache
   486    486 TS       - S<        0     0 xfs-buf/dm-0
   487    487 TS       - S<        0     0 xfs-data/dm-0
   488    488 TS       - S<        0     0 xfs-conv/dm-0
   489    489 TS       - S<        0     0 xfs-cil/dm-0
   490    490 TS       - S<        0     0 xfs-reclaim/dm-
   491    491 TS       - S<        0     0 xfs-log/dm-0
   492    492 TS       - S<        0     0 xfs-eofblocks/d
   493    493 TS       - S         0     0 xfsaild/dm-0
   577    577 TS       - Ss   251684 146208 systemd-journal
   596    596 TS       - Ss   124840  1316 lvmetad
   616    616 TS       - Ss    45356  1840 systemd-udevd
   655    655 TS       - S         0     0 hwrng
   712    712 TS       - S<        0     0 nfit
   726    726 TS       - S         0     0 jbd2/sda1-8
   727    727 TS       - S<        0     0 ext4-rsv-conver
   781    781 TS       - S<sl  55532  1052 auditd
   785    785 TS       - S<        0     0 rpciod
   786    786 TS       - S<        0     0 xprtiod
   810    810 TS       - Ss    69256  1400 rpcbind
   812    812 TS       - Ss    58208  2380 dbus-daemon
   833    833 TS       - Ss    21592  1284 irqbalance
   845    845 TS       - Ss    26820  2248 systemd-logind
   851    851 TS       - Ss    47296  2452 ntpd
   853    853 TS       - Ssl  612360 11692 polkitd
   857    857 TS       - Ss    44352  2604 qemu-ga
   861    861 TS       - Ss    12032   932 statsd-aggregat
   885    885 TS       - Ssl  528040 17808 python
   915    915 TS       - Ssl  195208  1928 gssproxy
   940    940 TS       - Ss   126420  1648 crond
  1000   1000 TS       - S<        0     0 kworker/7:1H
  1057   1057 TS       - Ssl  699636 56692 mcollectived
  1059   1059 TS       - Ssl  574280 18984 tuned
  1064   1064 TS       - Ssl  793528  2512 collectd
  1107   1107 TS       - Ssl  699184 79484 rsyslogd
  1120   1120 TS       - Ss   116760  4260 lldpd
  1133   1133 TS       - Ssl  251304 43280 puppet
  1144   1144 TS       - Ss   112940  4348 sshd
  1155   1155 TS       - S    116760  1584 lldpd
  1178   1178 TS       - Ss   100072 18548 haproxy
  1197   1197 TS       - Ssl  1443524 37488 containerd
  1199   1199 TS       - SNsl 117444  5088 osqueryd
  1318   1318 TS       - SNl  947784 43044 osqueryd
  1380   1380 TS       - Ss+  110204   860 agetty
  1399   1399 TS       - Ssl  134328 15888 unbound
  1445   1445 TS       - S<        0     0 kworker/4:1H
  2013   2013 TS       - S<        0     0 kworker/11:1H
  2226   2226 TS       - S<        0     0 kworker/6:1H
  2249   2249 TS       - S<        0     0 kworker/2:1H
  2738   2738 TS       - S<        0     0 kworker/10:1H
  2886   2886 TS       - S<        0     0 kworker/8:1H
  2911   2911 TS       - S<        0     0 kworker/9:1H
  3303   3303 TS       - S<        0     0 kworker/3:1H
  3497   3497 TS       - S<        0     0 kworker/5:1H
  6492   6492 TS       - S     89852  4100 pickup
  8999   8999 TS       - S         0     0 kworker/7:0
 18602  18602 TS       - S         0     0 kworker/4:0
 20196  20196 TS       - S         0     0 kworker/0:0
 20210  20210 TS       - S         0     0 kworker/3:1
 25862  25862 TS       - S         0     0 kworker/9:0
 26772  26772 TS       - S         0     0 kworker/u384:0
 27485  27485 TS       - Ssl  907840 106232 vault
 36701  36701 TS       - S         0     0 kworker/5:0
 36737  36737 TS       - S         0     0 kworker/6:2
 40227  40227 TS       - Ss    89748  2176 master
 40229  40229 TS       - S     90032  4264 qmgr
 41666  41666 TS       - Ssl  1527540 20452 nomad
 41681  41681 TS       - S    527020 72644 php
 42797  42797 TS       - Ssl  1658356 21012 nomad
 42811  42811 TS       - S    561892 105624 php
 43435  43435 TS       - Ssl  1658100 20524 nomad
 43449  43449 TS       - S    650924 108560 php
 44133  44133 TS       - Ssl  1349800 56352 dockerd
 44299  44299 TS       - Ssl  8986716 4774540 nomad
 44344  44344 TS       - Ssl  1593332 21280 nomad
 44359  44359 TS       - S    561252 105244 php
 45348  45348 TS       - S         0     0 kworker/1:2
 46229  46229 TS       - Ssl  1527540 21208 nomad
 46240  46240 TS       - S    527968 72408 php
 47283  47283 TS       - Ssl  1453552 20284 nomad
 47298  47298 TS       - S    640552 101208 php
 47456  47456 TS       - Ssl   33940 17848 consul-template
 47529  47529 TS       - S         0     0 kworker/u384:1
 49464  49464 TS       - S         0     0 kworker/10:1
 51923  51923 TS       - S         0     0 kworker/2:2
 55140  55140 TS       - S         0     0 kworker/5:2
 55676  55676 TS       - Ssl  1462260 20996 nomad
 55691  55691 TS       - S    527020 72388 php
 55862  55862 TS       - Sl   1582244 14156 nomad
 56252  56252 TS       - Ssl  1519600 20972 nomad
 56267  56267 TS       - S    623852 84376 php
 57396  57396 TS       - Ssl  1658868 20692 nomad
 57408  57408 TS       - S    621764 83264 php
 57486  57486 TS       - S         0     0 kworker/2:0
 57515  57515 TS       - Ssl  1519088 20744 nomad
 57528  57528 TS       - S    531836 78020 php
 57533  57533 TS       - Ssl  1461748 20180 nomad
 57549  57549 TS       - S    563980 108776 php
 57998  57998 TS       - Ssl  1658356 20096 nomad
 58013  58013 TS       - S    526876 71196 php
 58014  58014 TS       - Ssl  1723636 19944 nomad
 58028  58028 TS       - S    526876 71192 php
 58064  58064 TS       - Ssl  1585136 20528 nomad
 58077  58077 TS       - S    526876 71188 php
 58113  58113 TS       - Ssl  1519088 20804 nomad
 58114  58114 TS       - Ssl  1519344 20724 nomad
 58139  58139 TS       - S    526876 71196 php
 58143  58143 TS       - S    526976 71180 php
 58552  58552 TS       - S         0     0 kworker/11:1
 58812  58812 TS       - Ssl  1584880 20896 nomad
 58824  58824 TS       - S    526976 71192 php
 58945  58945 TS       - Ssl  1584880 20456 nomad
 58960  58960 TS       - S    632140 91824 php
 59661  59661 TS       - S         0     0 kworker/8:0
 59733  59733 TS       - S         0     0 kworker/1:1
 61275  61275 TS       - Ssl  1715952 20260 nomad
 61290  61290 TS       - S    526876 71712 php
 61325  61325 TS       - Ssl  1527284 21212 nomad
 61340  61340 TS       - S    529056 73520 php
 61348  61348 TS       - Ssl  1658356 19896 nomad
 61363  61363 TS       - S    526876 71160 php
 61464  61464 TS       - Ssl  1658868 20032 nomad
 61479  61479 TS       - S    553672 99288 php
 61526  61526 TS       - Ssl  1584880 20116 nomad
 61543  61543 TS       - S    532964 78812 php
 62390  62390 TS       - Ssl  1592308 20800 nomad
 62404  62404 TS       - S    616756 78212 php
 62439  62439 TS       - Ssl  1650416 20428 nomad
 62454  62454 TS       - S    616924 78312 php
 62501  62501 TS       - Ssl  1593332 20372 nomad
 62516  62516 TS       - S    527440 72552 php
 64024  64024 TS       - Ssl  1584624 20292 nomad
 64038  64038 TS       - S    524928 70720 php
 64160  64160 TS       - Ssl  1518832 20424 nomad
 64175  64175 TS       - S    563844 108440 php
 64211  64211 TS       - Ssl  1519088 20224 nomad
 64226  64226 TS       - S    524972 70704 php
 64368  64368 TS       - S         0     0 kworker/1:0
 64455  64455 TS       - Ssl  1527028 20684 nomad
 64456  64456 TS       - Ssl  1715696 20616 nomad
 64487  64487 TS       - S    524928 70732 php
 64488  64488 TS       - S    524972 70724 php
 64492  64492 TS       - Ssl  1584368 20752 nomad
 64514  64514 TS       - Ssl  1453296 20368 nomad
 64525  64525 TS       - S    524972 70704 php
 64535  64535 TS       - S    524928 70700 php
 64571  64571 TS       - Ssl  1650160 20392 nomad
 64586  64586 TS       - S    526876 71188 php
 64822  64822 TS       - Ssl  1649904 20368 nomad
 64837  64837 TS       - S    559832 104376 php
 64849  64849 TS       - Ssl  1584368 19900 nomad
 64863  64863 TS       - S    547328 92536 php
 64952  64952 TS       - Ssl  1584368 20612 nomad
 64967  64967 TS       - S    524972 70732 php
 65212  65212 TS       - Ssl  1519344 19972 nomad
 65227  65227 TS       - S    526876 71168 php
 65270  65270 TS       - Ssl  1649648 20484 nomad
 65284  65284 TS       - S    524972 70700 php
 65474  65474 TS       - Ssl  1650160 20352 nomad
 65489  65489 TS       - S    524972 70704 php
 65571  65571 TS       - Ssl  1584624 19724 nomad
 65586  65586 TS       - S    645108 104688 php
 65649  65649 TS       - Ssl  1453040 20524 nomad
 65663  65663 TS       - S    524972 70712 php
 65692  65692 TS       - Ssl  1584624 20936 nomad
 65707  65707 TS       - S    532832 78232 php
 65770  65770 TS       - Ssl  1519088 19944 nomad
 65781  65781 TS       - S    642752 103024 php
 66054  66054 TS       - Ssl  1387504 19584 nomad
 66068  66068 TS       - S    527168 72920 php
 66994  66994 TS       - Ssl  1649648 21024 nomad
 67008  67008 TS       - S    557648 102272 php
 67368  67368 TS       - S         0     0 kworker/11:2
 67448  67448 TS       - S         0     0 kworker/2:1
 67774  67774 TS       - Ssl  1453552 20628 nomad
 67791  67791 TS       - S    529124 75276 php
 67836  67836 TS       - Ssl  1518576 20416 nomad
 67852  67852 TS       - S    670820 192016 php
 68024  68024 TS       - Ssl  1445356 20072 nomad
 68039  68039 TS       - S    608636 154116 php
 68553  68553 TS       - Ssl  1510892 20388 nomad
 68567  68567 TS       - S    524972 70712 php
 68614  68614 TS       - Ssl  1632744 19832 nomad
 68628  68628 TS       - S    645592 114012 php
 68719  68719 TS       - Ssl  1575660 19836 nomad
 68735  68735 TS       - S    627180 88776 php
 68751  68751 TS       - Ssl  1567208 19088 nomad
 68766  68766 TS       - S    585464 128788 php
 68776  68776 TS       - Ss   158944  5664 sshd
 68778  68778 TS       - D    159256  2632 sshd
 68779  68779 TS       - Ss   116476  3020 bash
 68838  68838 TS       - Ssl  1576172 19444 nomad
 68853  68853 TS       - S    545844 90816 php
 68854  68854 TS       - S    241368  4740 sudo
 68856  68856 TS       - S    191876  2360 su
 68857  68857 TS       - S    116620  3096 bash
 69322  69322 TS       - Ssl  1510380 19404 nomad
 69336  69336 TS       - S    605540 149436 php
 69411  69411 TS       - R+   153324  1504 ps
 73301  73301 TS       - Ssl  853788 119552 consul
 79370  79370 TS       - Ssl  720648 15720 node_exporter
 86410  86410 TS       - S         0     0 kworker/11:0
100861 100861 TS       - S         0     0 kworker/7:1
108957 108957 TS       - S         0     0 kworker/8:2
112171 112171 TS       - Sl   1648292 15388 nomad
112183 112183 TS       - Sl   1516964 15356 nomad
112237 112237 TS       - Sl   1451428 15220 nomad
128957 128957 TS       - Sl   1516708 14304 nomad
129006 129006 TS       - Sl   1516964 14440 nomad
129046 129046 TS       - Sl   1451236 14784 nomad
129079 129079 TS       - Sl   1582756 14424 nomad
129169 129169 TS       - Sl   1516964 14684 nomad
129215 129215 TS       - Sl   1582052 14348 nomad
129242 129242 TS       - Sl   1516452 14584 nomad
129316 129316 TS       - Sl   1451236 14940 nomad
129454 129454 TS       - Sl   1581988 14996 nomad
129745 129745 TS       - Sl   1450916 15008 nomad
129788 129788 TS       - Sl   1656232 15048 nomad
129896 129896 TS       - Sl   1451172 14464 nomad
129941 129941 TS       - Sl   1450724 14380 nomad
129974 129974 TS       - Sl   1582500 15172 nomad
130011 130011 TS       - Sl   1582756 15124 nomad
130049 130049 TS       - Sl   1582500 15476 nomad
130492 130492 TS       - Sl   1581988 14524 nomad
130528 130528 TS       - Sl   1582500 14736 nomad
130713 130713 TS       - Sl   1582308 14524 nomad
130876 130876 TS       - Sl   1713316 14544 nomad
130922 130922 TS       - Sl   1516964 14688 nomad
130978 130978 TS       - Sl   1516772 15260 nomad
131089 131089 TS       - Sl   1451172 14716 nomad
131136 131136 TS       - Sl   1385124 14508 nomad
131182 131182 TS       - Sl   1648100 15096 nomad
131217 131217 TS       - Sl   1516708 14492 nomad
131417 131417 TS       - Sl   1451492 16440 nomad
131455 131455 TS       - Sl   1516964 14996 nomad
131530 131530 TS       - Sl   1590696 17556 nomad
131629 131629 TS       - Sl   1517028 15336 nomad
131667 131667 TS       - Sl   1516708 14608 nomad
131702 131702 TS       - Sl   1582564 15404 nomad
131734 131734 TS       - Sl   1385636 15308 nomad
131772 131772 TS       - Sl   1517220 14620 nomad
131814 131814 TS       - Sl   1516964 15012 nomad
131854 131854 TS       - Sl   1656232 17428 nomad
131914 131914 TS       - Sl   1590696 16696 nomad
131953 131953 TS       - Sl   1648036 15168 nomad
131993 131993 TS       - Sl   1581988 15064 nomad
132033 132033 TS       - Sl   1385124 14172 nomad
132068 132068 TS       - Sl   1451428 15224 nomad
132108 132108 TS       - Sl   1582244 14864 nomad
132160 132160 TS       - Sl   1787304 16788 nomad
132274 132274 TS       - Sl   1582564 14660 nomad
132337 132337 TS       - Sl   1525224 15460 nomad
132377 132377 TS       - Sl   1656488 14696 nomad
132412 132412 TS       - Sl   1647524 15220 nomad
132444 132444 TS       - Sl   1516964 14872 nomad
132484 132484 TS       - Sl   1516516 14504 nomad
132511 132511 TS       - Sl   1582500 15108 nomad
132723 132723 TS       - Sl   1385636 15236 nomad
138779 138779 TS       - Sl   1516452 14972 nomad
156503 156503 TS       - Sl   914852 22972 haproxy
159931 159931 TS       - Sl   1692160 21428 sensu-client
160314 160314 TS       - S         0     0 kworker/9:1
161600 161600 TS       - SNl  5387976 532584 java
167674 167674 TS       - S         0     0 kworker/6:0
178917 178917 TS       - S         0     0 kworker/3:0
178969 178969 TS       - R         0     0 kworker/0:1
188634 188634 TS       - S         0     0 kworker/10:0
193694 193694 TS       - S         0     0 kworker/4:1

   WORKER28

   PID    TID CLS RTPRIO STAT    VSZ   RSS COMMAND
     1      1 TS       - Ss    52720  4556 systemd
     2      2 TS       - S         0     0 kthreadd
     4      4 TS       - S<        0     0 kworker/0:0H
     6      6 TS       - S         0     0 ksoftirqd/0
     7      7 FF      99 S         0     0 migration/0
     8      8 TS       - S         0     0 rcu_bh
     9      9 TS       - S         0     0 rcu_sched
    10     10 TS       - S<        0     0 lru-add-drain
    11     11 FF      99 S         0     0 watchdog/0
    12     12 FF      99 S         0     0 watchdog/1
    13     13 FF      99 S         0     0 migration/1
    14     14 TS       - S         0     0 ksoftirqd/1
    16     16 TS       - S<        0     0 kworker/1:0H
    17     17 FF      99 S         0     0 watchdog/2
    18     18 FF      99 S         0     0 migration/2
    19     19 TS       - S         0     0 ksoftirqd/2
    21     21 TS       - S<        0     0 kworker/2:0H
    22     22 FF      99 S         0     0 watchdog/3
    23     23 FF      99 S         0     0 migration/3
    24     24 TS       - S         0     0 ksoftirqd/3
    26     26 TS       - S<        0     0 kworker/3:0H
    27     27 FF      99 S         0     0 watchdog/4
    28     28 FF      99 S         0     0 migration/4
    29     29 TS       - S         0     0 ksoftirqd/4
    31     31 TS       - S<        0     0 kworker/4:0H
    32     32 FF      99 S         0     0 watchdog/5
    33     33 FF      99 S         0     0 migration/5
    34     34 TS       - S         0     0 ksoftirqd/5
    36     36 TS       - S<        0     0 kworker/5:0H
    37     37 FF      99 S         0     0 watchdog/6
    38     38 FF      99 S         0     0 migration/6
    39     39 TS       - S         0     0 ksoftirqd/6
    41     41 TS       - S<        0     0 kworker/6:0H
    42     42 FF      99 S         0     0 watchdog/7
    43     43 FF      99 S         0     0 migration/7
    44     44 TS       - S         0     0 ksoftirqd/7
    46     46 TS       - S<        0     0 kworker/7:0H
    47     47 FF      99 S         0     0 watchdog/8
    48     48 FF      99 S         0     0 migration/8
    49     49 TS       - S         0     0 ksoftirqd/8
    51     51 TS       - S<        0     0 kworker/8:0H
    52     52 FF      99 S         0     0 watchdog/9
    53     53 FF      99 S         0     0 migration/9
    54     54 TS       - S         0     0 ksoftirqd/9
    56     56 TS       - S<        0     0 kworker/9:0H
    57     57 FF      99 S         0     0 watchdog/10
    58     58 FF      99 S         0     0 migration/10
    59     59 TS       - S         0     0 ksoftirqd/10
    61     61 TS       - S<        0     0 kworker/10:0H
    62     62 FF      99 S         0     0 watchdog/11
    63     63 FF      99 S         0     0 migration/11
    64     64 TS       - S         0     0 ksoftirqd/11
    66     66 TS       - S<        0     0 kworker/11:0H
    68     68 TS       - S         0     0 kdevtmpfs
    69     69 TS       - S<        0     0 netns
    70     70 TS       - S         0     0 khungtaskd
    71     71 TS       - S<        0     0 writeback
    72     72 TS       - S<        0     0 kintegrityd
    73     73 TS       - S<        0     0 bioset
    74     74 TS       - S<        0     0 bioset
    75     75 TS       - S<        0     0 bioset
    76     76 TS       - S<        0     0 kblockd
    77     77 TS       - S<        0     0 md
    78     78 TS       - S<        0     0 edac-poller
    79     79 TS       - S<        0     0 watchdogd
    85     85 TS       - S         0     0 kswapd0
    86     86 TS       - SN        0     0 ksmd
    87     87 TS       - SN        0     0 khugepaged
    88     88 TS       - S<        0     0 crypto
    96     96 TS       - S<        0     0 kthrotld
    98     98 TS       - S<        0     0 kmpath_rdacd
    99     99 TS       - S<        0     0 kaluad
   101    101 TS       - S<        0     0 kpsmoused
   102    102 TS       - S<        0     0 ipv6_addrconf
   115    115 TS       - S<        0     0 deferwq
   153    153 TS       - S         0     0 kauditd
   363    363 TS       - S<        0     0 ata_sff
   367    367 TS       - S         0     0 scsi_eh_0
   368    368 TS       - S<        0     0 scsi_tmf_0
   369    369 TS       - S         0     0 scsi_eh_1
   370    370 TS       - S<        0     0 scsi_tmf_1
   375    375 TS       - S<        0     0 ttm_swap
   381    381 TS       - S<        0     0 virtscsi-scan
   382    382 TS       - S         0     0 scsi_eh_2
   383    383 TS       - S<        0     0 scsi_tmf_2
   418    418 TS       - S<        0     0 kworker/0:1H
   459    459 TS       - S<        0     0 kdmflush
   460    460 TS       - S<        0     0 bioset
   471    471 TS       - S<        0     0 kdmflush
   472    472 TS       - S<        0     0 bioset
   485    485 TS       - S<        0     0 bioset
   486    486 TS       - S<        0     0 xfsalloc
   487    487 TS       - S<        0     0 xfs_mru_cache
   488    488 TS       - S<        0     0 xfs-buf/dm-0
   489    489 TS       - S<        0     0 xfs-data/dm-0
   490    490 TS       - S<        0     0 xfs-conv/dm-0
   491    491 TS       - S<        0     0 xfs-cil/dm-0
   492    492 TS       - S<        0     0 xfs-reclaim/dm-
   493    493 TS       - S<        0     0 xfs-log/dm-0
   494    494 TS       - S<        0     0 xfs-eofblocks/d
   495    495 TS       - S         0     0 xfsaild/dm-0
   577    577 TS       - Ss   121232 50128 systemd-journal
   595    595 TS       - Ss   198572  1344 lvmetad
   611    611 TS       - Ss    45356  1760 systemd-udevd
   653    653 TS       - S         0     0 hwrng
   666    666 TS       - S<        0     0 kworker/4:1H
   707    707 TS       - S<        0     0 nfit
   758    758 TS       - S         0     0 jbd2/sda1-8
   759    759 TS       - S<        0     0 ext4-rsv-conver
   783    783 TS       - S<sl  55532  1016 auditd
   787    787 TS       - S<        0     0 rpciod
   788    788 TS       - S<        0     0 xprtiod
   812    812 TS       - Ss    69372  1360 rpcbind
   813    813 TS       - Ss    58204  2084 dbus-daemon
   814    814 TS       - Ssl  195208  1996 gssproxy
   820    820 TS       - Ssl  612360 10256 polkitd
   835    835 TS       - Ss    26808  2128 systemd-logind
   849    849 TS       - Ss    44220  2392 qemu-ga
   866    866 TS       - Ss    21592  1216 irqbalance
   872    872 TS       - Ss    47296  2212 ntpd
   874    874 TS       - Ss    12032   892 statsd-aggregat
   899    899 TS       - Ssl  528040 15472 python
   917    917 TS       - Ss   126416  1588 crond
  1004   1004 TS       - S<        0     0 kworker/6:1H
  1068   1068 TS       - Ssl  251380 41764 puppet
  1069   1069 TS       - Ssl  574280 16804 tuned
  1072   1072 TS       - Ss   116760  3480 lldpd
  1078   1078 TS       - S    116760  1660 lldpd
  1084   1084 TS       - Ss   112940  3828 sshd
  1088   1088 TS       - Ssl  699632 54760 mcollectived
  1097   1097 TS       - Ssl  793528  4344 collectd
  1147   1147 TS       - Ssl  637076 32092 rsyslogd
  1170   1170 TS       - Ssl  1566400 34748 containerd
  1263   1263 TS       - SNsl 117444  4012 osqueryd
  1324   1324 TS       - Ss   100072 18512 haproxy
  1374   1374 TS       - Ssl  134592 15548 unbound
  1439   1439 TS       - SNl  816616 27356 osqueryd
  1478   1478 TS       - Ss+  110204   824 agetty
  1568   1568 TS       - Ssl  1592820 20720 nomad
  1583   1583 TS       - S    562336 106108 php
  2013   2013 TS       - S<        0     0 kworker/3:1H
  2314   2314 TS       - S<        0     0 kworker/2:1H
  2353   2353 TS       - S<        0     0 kworker/1:1H
  2407   2407 TS       - S<        0     0 kworker/11:1H
  2722   2722 TS       - S<        0     0 kworker/5:1H
  2876   2876 TS       - Ssl  1593332 20664 nomad
  2891   2891 TS       - S    527020 72644 php
  2912   2912 TS       - S<        0     0 kworker/8:1H
  2988   2988 TS       - S<        0     0 kworker/9:1H
  3105   3105 TS       - S<        0     0 kworker/7:1H
  3175   3175 TS       - S<        0     0 kworker/10:1H
  4116   4116 TS       - Ssl  1724148 20556 nomad
  4131   4131 TS       - S    562372 105704 php
  4973   4973 TS       - Ssl  1519344 20364 nomad
  4987   4987 TS       - S    528136 72948 php
  6939   6939 TS       - S         0     0 kworker/6:2
  7773   7773 TS       - Ss    89748  2176 master
  7775   7775 TS       - S     90032  4268 qmgr
  8108   8108 TS       - Ssl  1461748 20064 nomad
  8122   8122 TS       - S    579360 124520 php
  9667   9667 TS       - R         0     0 kworker/10:2
 10207  10207 TS       - Ssl  1593332 20404 nomad
 10222  10222 TS       - S    533820 79792 php
 10886  10886 TS       - Ssl  1658868 21136 nomad
 10900  10900 TS       - S    638516 98520 php
 11676  11676 TS       - S         0     0 kworker/1:1
 12000  12000 TS       - Ssl  1658868 21636 nomad
 12014  12014 TS       - S    617236 78956 php
 12404  12404 TS       - Ssl  1592820 20424 nomad
 12418  12418 TS       - S    552740 98044 php
 12496  12496 TS       - S         0     0 kworker/2:1
 15887  15887 TS       - Ssl  1592820 19988 nomad
 15902  15902 TS       - S    527020 72384 php
 16263  16263 TS       - Ssl  1658100 21128 nomad
 16277  16277 TS       - S    527168 72912 php
 16976  16976 TS       - Ssl  1715696 20516 nomad
 16990  16990 TS       - S    623952 84692 php
 17335  17335 TS       - S         0     0 kworker/9:2
 18607  18607 TS       - Ssl  1519088 20608 nomad
 18621  18621 TS       - S    531840 77716 php
 19016  19016 TS       - Ssl  1592628 20672 nomad
 19030  19030 TS       - S    526876 71196 php
 19036  19036 TS       - Ssl  1526772 20740 nomad
 19051  19051 TS       - S    526876 71200 php
 19121  19121 TS       - Ssl  1789940 19952 nomad
 19140  19140 TS       - S    526976 71208 php
 19229  19229 TS       - S         0     0 kworker/10:0
 19255  19255 TS       - Ssl  1715696 20180 nomad
 19272  19272 TS       - S    526876 71188 php
 19283  19283 TS       - Ssl  1519600 19976 nomad
 19299  19299 TS       - S    526876 71192 php
 19416  19416 TS       - S         0     0 kworker/2:2
 19925  19925 TS       - Ssl  1658100 21168 nomad
 19938  19938 TS       - S    526876 71188 php
 20258  20258 TS       - S         0     0 kworker/3:1
 20807  20807 TS       - Ssl  1593076 20344 nomad
 20822  20822 TS       - S    527032 71880 php
 21220  21220 TS       - Ssl  1658612 20552 nomad
 21235  21235 TS       - S    637024 97180 php
 21365  21365 TS       - S         0     0 kworker/5:2
 22378  22378 TS       - S         0     0 kworker/8:0
 23287  23287 TS       - Ssl  1723636 21648 nomad
 23301  23301 TS       - S    529056 73712 php
 23579  23579 TS       - Ssl  1593076 20300 nomad
 23593  23593 TS       - S    615448 76156 php
 23657  23657 TS       - Ssl  1649904 21056 nomad
 23672  23672 TS       - S    526876 71156 php
 24375  24375 TS       - Ssl  1453552 20556 nomad
 24391  24391 TS       - S    585288 129780 php
 24563  24563 TS       - Ssl  1584368 20448 nomad
 24577  24577 TS       - S    617068 78576 php
 24593  24593 TS       - Ssl  1519088 20468 nomad
 24609  24609 TS       - S    617188 78944 php
 24852  24852 TS       - Ssl  1649904 20672 nomad
 24866  24866 TS       - S    621584 82504 php
 24942  24942 TS       - Ssl  1650416 20600 nomad
 24957  24957 TS       - S    527588 72552 php
 26081  26081 TS       - Ssl  1453552 19980 nomad
 26096  26096 TS       - S    526876 71180 php
 27109  27109 TS       - Ssl  1658356 19888 nomad
 27124  27124 TS       - S    630072 91648 php
 27289  27289 TS       - S         0     0 kworker/4:0
 27509  27509 TS       - Ssl  1650160 20492 nomad
 27523  27523 TS       - S    526876 71160 php
 27536  27536 TS       - Ssl  1649904 20200 nomad
 27551  27551 TS       - S    561956 106836 php
 27569  27569 TS       - Ssl  1649904 20520 nomad
 27584  27584 TS       - S    549480 95188 php
 27628  27628 TS       - Ssl  1584624 20356 nomad
 27642  27642 TS       - S    565584 111600 php
 27991  27991 TS       - Ssl  1519088 20392 nomad
 28005  28005 TS       - S    647024 107504 php
 28111  28111 TS       - Ssl  1584624 20388 nomad
 28126  28126 TS       - S    532948 78248 php
 28375  28375 TS       - Ssl  1592820 20624 nomad
 28390  28390 TS       - S    534988 80764 php
 28480  28480 TS       - Ssl  1650160 20384 nomad
 28494  28494 TS       - S    645160 105196 php
 29479  29479 TS       - Ssl  1519600 20892 nomad
 29498  29498 TS       - S    575232 120104 php
 31138  31138 TS       - S         0     0 kworker/2:0
 31380  31380 TS       - Ssl  1649904 20624 nomad
 31394  31394 TS       - S    524972 70696 php
 31722  31722 TS       - Ssl  1518576 20020 nomad
 31737  31737 TS       - S    524972 70700 php
 31820  31820 TS       - Ssl  1453040 19624 nomad
 31834  31834 TS       - S    524972 70720 php
 31835  31835 TS       - Ssl  1388016 20464 nomad
 31849  31849 TS       - S    524972 70720 php
 31867  31867 TS       - Ssl  1387504 20708 nomad
 31882  31882 TS       - S    536872 82084 php
 32001  32001 TS       - Ssl  1453552 20848 nomad
 32021  32021 TS       - S    524972 70692 php
 32115  32115 TS       - Ssl  1518640 20208 nomad
 32129  32129 TS       - S    524972 70720 php
 32169  32169 TS       - Ssl  1453296 20224 nomad
 32183  32183 TS       - S    524972 70700 php
 32260  32260 TS       - Ssl  1781232 20728 nomad
 32282  32282 TS       - S    524972 70716 php
 32300  32300 TS       - Ssl  1584112 20720 nomad
 32317  32317 TS       - R    586008 128920 php
 32408  32408 TS       - Ssl  1584112 20348 nomad
 32423  32423 TS       - S    524972 70684 php
 32571  32571 TS       - Ssl  1584880 20424 nomad
 32586  32586 TS       - S    524972 70728 php
 32600  32600 TS       - S         0     0 kworker/0:1
 32860  32860 TS       - Ssl  1723636 20104 nomad
 32875  32875 TS       - S    524972 70716 php
 32971  32971 TS       - Ssl  1584368 21104 nomad
 32985  32985 TS       - S    524972 70700 php
 33097  33097 TS       - Ssl  1649712 20876 nomad
 33111  33111 TS       - S    524972 70700 php
 33138  33138 TS       - Ssl  1641964 19824 nomad
 33153  33153 TS       - S    579864 127076 php
 33348  33348 TS       - Ssl  1585136 20224 nomad
 33362  33362 TS       - S    664076 208792 php
 33457  33457 TS       - S         0     0 kworker/4:2
 33462  33462 TS       - S         0     0 kworker/0:3
 33463  33463 TS       - S         0     0 kworker/0:4
 33464  33464 TS       - S         0     0 kworker/0:5
 33465  33465 TS       - S         0     0 kworker/0:6
 33466  33466 TS       - S         0     0 kworker/0:7
 33467  33467 TS       - S         0     0 kworker/0:8
 33468  33468 TS       - S         0     0 kworker/0:9
 33469  33469 TS       - S         0     0 kworker/0:10
 33470  33470 TS       - S         0     0 kworker/0:11
 33471  33471 TS       - S         0     0 kworker/0:12
 33472  33472 TS       - S         0     0 kworker/0:13
 33473  33473 TS       - S         0     0 kworker/0:14
 33474  33474 TS       - S         0     0 kworker/0:15
 33475  33475 TS       - S         0     0 kworker/0:16
 33476  33476 TS       - S         0     0 kworker/0:17
 33477  33477 TS       - S         0     0 kworker/0:18
 33478  33478 TS       - S         0     0 kworker/0:19
 33479  33479 TS       - S         0     0 kworker/0:20
 33480  33480 TS       - S         0     0 kworker/0:21
 33481  33481 TS       - S         0     0 kworker/0:22
 33482  33482 TS       - S         0     0 kworker/0:23
 33560  33560 TS       - Ssl  1649904 20308 nomad
 33577  33577 TS       - S    555684 99704 php
 33611  33611 TS       - S         0     0 kworker/0:24
 33612  33612 TS       - S         0     0 kworker/0:25
 33613  33613 TS       - S         0     0 kworker/0:26
 33614  33614 TS       - S         0     0 kworker/0:27
 33615  33615 TS       - S         0     0 kworker/0:28
 33616  33616 TS       - S         0     0 kworker/0:29
 33617  33617 TS       - S         0     0 kworker/0:30
 33618  33618 TS       - S         0     0 kworker/0:31
 33619  33619 TS       - S         0     0 kworker/0:32
 33620  33620 TS       - S         0     0 kworker/0:33
 33621  33621 TS       - S         0     0 kworker/0:34
 33622  33622 TS       - S         0     0 kworker/0:35
 33623  33623 TS       - S         0     0 kworker/0:36
 33624  33624 TS       - S         0     0 kworker/0:37
 33625  33625 TS       - S         0     0 kworker/0:38
 33626  33626 TS       - S         0     0 kworker/0:39
 33627  33627 TS       - S         0     0 kworker/0:40
 33628  33628 TS       - S         0     0 kworker/0:41
 33629  33629 TS       - S         0     0 kworker/0:42
 33630  33630 TS       - S         0     0 kworker/0:43
 33631  33631 TS       - S         0     0 kworker/0:44
 33632  33632 TS       - S         0     0 kworker/0:45
 33633  33633 TS       - S         0     0 kworker/0:46
 33634  33634 TS       - S         0     0 kworker/0:47
 33635  33635 TS       - S         0     0 kworker/0:48
 33636  33636 TS       - S         0     0 kworker/0:49
 33637  33637 TS       - S         0     0 kworker/0:50
 33638  33638 TS       - S         0     0 kworker/0:51
 33639  33639 TS       - S         0     0 kworker/0:52
 33640  33640 TS       - S         0     0 kworker/0:53
 33641  33641 TS       - S         0     0 kworker/0:54
 33642  33642 TS       - S         0     0 kworker/0:55
 33643  33643 TS       - S         0     0 kworker/0:56
 33644  33644 TS       - S         0     0 kworker/0:57
 33645  33645 TS       - S         0     0 kworker/0:58
 33646  33646 TS       - S         0     0 kworker/3:0
 33956  33956 TS       - Ssl  1584368 20392 nomad
 33970  33970 TS       - S    542660 87804 php
 33986  33986 TS       - S         0     0 kworker/11:0
 34155  34155 TS       - S         0     0 kworker/10:1
 34160  34160 TS       - S    182500  2592 crond
 34164  34164 TS       - Ss   113280  1200 sh
 34165  34165 TS       - S    113284  1432 puppet_cron.sh
 34167  34167 TS       - S         0     0 kworker/u384:1
 34173  34173 TS       - S    108052   356 sleep
 34558  34558 TS       - Ssl  1584112 20104 nomad
 34574  34574 TS       - S    530612 75324 php
 34766  34766 TS       - Ss   158944  5660 sshd
 34769  34769 TS       - D    159256  2624 sshd
 34770  34770 TS       - Ss   116476  3016 bash
 34828  34828 TS       - S    241368  4744 sudo
 34829  34829 TS       - S    191876  2360 su
 34830  34830 TS       - S    116620  3096 bash
 34948  34948 TS       - Ssl  1567464 19524 nomad
 34966  34966 TS       - S    531144 75800 php
 35445  35445 TS       - Ssl  1501672 19772 nomad
 35460  35460 TS       - S    639472 108312 php
 35493  35493 TS       - Ssl  1354208 18376 nomad
 35508  35508 TS       - S    530084 75692 php
 35524  35524 TS       - Ssl  1346012 19784 nomad
 35538  35538 TS       - S    540360 86412 php
 35556  35556 TS       - R+   153324  1504 ps
 36344  36344 TS       - S         0     0 kworker/7:0
 45235  45235 TS       - S         0     0 kworker/3:2
 61034  61034 TS       - SNl  5387976 541016 java
 63818  63818 TS       - S         0     0 kworker/5:0
 71447  71447 TS       - S         0     0 kworker/u384:0
 74875  74875 TS       - Sl   1582500 16440 nomad
 75990  75990 TS       - S         0     0 kworker/11:1
 80831  80831 TS       - Sl   914480 24384 haproxy
 90102  90102 TS       - S         0     0 kworker/0:0
117763 117763 TS       - Sl   1692156 22444 sensu-client
120164 120164 TS       - S         0     0 kworker/1:0
123252 123252 TS       - Ssl  908096 84540 vault
125875 125875 TS       - Ssl   34196 17584 consul-template
141096 141096 TS       - Ssl  1144996 57084 dockerd
141262 141262 TS       - Ssl  8976564 6887240 nomad
141883 141883 TS       - S         0     0 kworker/6:1
147328 147328 TS       - Sl   1516708 15088 nomad
147381 147381 TS       - Sl   1516196 14936 nomad
147475 147475 TS       - Sl   1647844 14900 nomad
147510 147510 TS       - Sl   1647524 14632 nomad
147547 147547 TS       - Sl   1516196 14372 nomad
147584 147584 TS       - Sl   1647780 14344 nomad
147615 147615 TS       - Sl   1582564 14740 nomad
147694 147694 TS       - Sl   1451428 14512 nomad
147729 147729 TS       - Sl   1516708 14872 nomad
147851 147851 TS       - Sl   1516452 14952 nomad
147862 147862 TS       - Ssl  853788 114140 consul
147898 147898 TS       - Sl   1582244 14476 nomad
147984 147984 TS       - Sl   1450980 14204 nomad
148136 148136 TS       - Sl   1582244 14692 nomad
148200 148200 TS       - Sl   1517028 15412 nomad
148377 148377 TS       - Sl   1451428 14864 nomad
148480 148480 TS       - Sl   1582564 14484 nomad
148537 148537 TS       - Sl   1516452 15220 nomad
148568 148568 TS       - Sl   1451172 15544 nomad
148659 148659 TS       - Sl   1524904 15040 nomad
149012 149012 TS       - Sl   1590504 14720 nomad
149047 149047 TS       - Sl   1451428 14680 nomad
149127 149127 TS       - Sl   1451172 14704 nomad
149299 149299 TS       - Sl   1516516 14780 nomad
149447 149447 TS       - Sl   1516452 15108 nomad
149499 149499 TS       - Sl   1385892 15240 nomad
149667 149667 TS       - Sl   1516452 14956 nomad
149740 149740 TS       - Sl   1451236 14768 nomad
149776 149776 TS       - Sl   1582500 14432 nomad
149827 149827 TS       - Sl   1582500 14616 nomad
149887 149887 TS       - Sl   1581988 14428 nomad
150110 150110 TS       - Sl   1582500 15332 nomad
150185 150185 TS       - Sl   1451428 15916 nomad
150231 150231 TS       - Sl   1582244 15216 nomad
150322 150322 TS       - Sl   1525160 17512 nomad
150387 150387 TS       - Sl   1516964 14644 nomad
150459 150459 TS       - Sl   1582500 15320 nomad
150552 150552 TS       - Sl   1385892 15200 nomad
150585 150585 TS       - Sl   1647780 15504 nomad
150624 150624 TS       - Sl   1582500 15020 nomad
150686 150686 TS       - Sl   1451172 15352 nomad
150757 150757 TS       - Sl   1656232 15064 nomad
150805 150805 TS       - Sl   1451428 14140 nomad
150881 150881 TS       - Sl   1459688 17424 nomad
150924 150924 TS       - Sl   1459880 16396 nomad
150962 150962 TS       - Sl   1648356 15212 nomad
151027 151027 TS       - Sl   1516708 14572 nomad
151087 151087 TS       - Sl   1582244 14760 nomad
151128 151128 TS       - Sl   1582500 15408 nomad
151179 151179 TS       - Sl   1581732 14760 nomad
151263 151263 TS       - Sl   1525416 17504 nomad
151299 151299 TS       - Sl   1516964 15040 nomad
151358 151358 TS       - Sl   1647780 14912 nomad
151398 151398 TS       - Sl   1459688 17260 nomad
151439 151439 TS       - Sl   1451172 14480 nomad
151475 151475 TS       - Sl   1451236 15896 nomad
151518 151518 TS       - Sl   1516964 15036 nomad
151555 151555 TS       - Sl   1590696 14944 nomad
151649 151649 TS       - Sl   1516964 15228 nomad
151709 151709 TS       - Sl   1582244 15300 nomad
151779 151779 TS       - Sl   1582308 15256 nomad
151993 151993 TS       - Sl   1581988 15064 nomad
153197 153197 TS       - S         0     0 kworker/7:2
158337 158337 TS       - Sl   1582308 14256 nomad
160946 160946 TS       - S         0     0 kworker/11:2
161667 161667 TS       - S         0     0 kworker/u384:2
168527 168527 TS       - S         0     0 kworker/0:2
169901 169901 TS       - S         0     0 kworker/4:1
179453 179453 TS       - S         0     0 kworker/9:0
183405 183405 TS       - Sl   1647332 14452 nomad
184325 184325 TS       - S     89852  4096 pickup
186937 186937 TS       - Sl   1451492 15036 nomad
186943 186943 TS       - Sl   1517220 15764 nomad
187000 187000 TS       - Sl   1451428 15112 nomad
187885 187885 TS       - Ssl  720648 15408 node_exporter
193203 193203 TS       - S         0     0 kworker/8:1

Please let us know what other info we might collect and share if it helps.

@anastazya
Copy link

Another one bit the dust today.

Screenshot 2021-12-02 at 15 24 50

   WORKER 28

   PID    TID CLS RTPRIO STAT    VSZ   RSS COMMAND
     1      1 TS       - Ss    52720  4556 systemd
     2      2 TS       - S         0     0 kthreadd
     4      4 TS       - S<        0     0 kworker/0:0H
     6      6 TS       - S         0     0 ksoftirqd/0
     7      7 FF      99 S         0     0 migration/0
     8      8 TS       - S         0     0 rcu_bh
     9      9 TS       - S         0     0 rcu_sched
    10     10 TS       - S<        0     0 lru-add-drain
    11     11 FF      99 S         0     0 watchdog/0
    12     12 FF      99 S         0     0 watchdog/1
    13     13 FF      99 S         0     0 migration/1
    14     14 TS       - S         0     0 ksoftirqd/1
    16     16 TS       - S<        0     0 kworker/1:0H
    17     17 FF      99 S         0     0 watchdog/2
    18     18 FF      99 S         0     0 migration/2
    19     19 TS       - S         0     0 ksoftirqd/2
    21     21 TS       - S<        0     0 kworker/2:0H
    22     22 FF      99 S         0     0 watchdog/3
    23     23 FF      99 S         0     0 migration/3
    24     24 TS       - S         0     0 ksoftirqd/3
    26     26 TS       - S<        0     0 kworker/3:0H
    27     27 FF      99 S         0     0 watchdog/4
    28     28 FF      99 S         0     0 migration/4
    29     29 TS       - S         0     0 ksoftirqd/4
    31     31 TS       - S<        0     0 kworker/4:0H
    32     32 FF      99 S         0     0 watchdog/5
    33     33 FF      99 S         0     0 migration/5
    34     34 TS       - S         0     0 ksoftirqd/5
    36     36 TS       - S<        0     0 kworker/5:0H
    37     37 FF      99 S         0     0 watchdog/6
    38     38 FF      99 S         0     0 migration/6
    39     39 TS       - S         0     0 ksoftirqd/6
    41     41 TS       - S<        0     0 kworker/6:0H
    42     42 FF      99 S         0     0 watchdog/7
    43     43 FF      99 S         0     0 migration/7
    44     44 TS       - S         0     0 ksoftirqd/7
    46     46 TS       - S<        0     0 kworker/7:0H
    47     47 FF      99 S         0     0 watchdog/8
    48     48 FF      99 S         0     0 migration/8
    49     49 TS       - S         0     0 ksoftirqd/8
    51     51 TS       - S<        0     0 kworker/8:0H
    52     52 FF      99 S         0     0 watchdog/9
    53     53 FF      99 S         0     0 migration/9
    54     54 TS       - S         0     0 ksoftirqd/9
    56     56 TS       - S<        0     0 kworker/9:0H
    57     57 FF      99 S         0     0 watchdog/10
    58     58 FF      99 S         0     0 migration/10
    59     59 TS       - S         0     0 ksoftirqd/10
    61     61 TS       - S<        0     0 kworker/10:0H
    62     62 FF      99 S         0     0 watchdog/11
    63     63 FF      99 S         0     0 migration/11
    64     64 TS       - S         0     0 ksoftirqd/11
    66     66 TS       - S<        0     0 kworker/11:0H
    68     68 TS       - S         0     0 kdevtmpfs
    69     69 TS       - S<        0     0 netns
    70     70 TS       - S         0     0 khungtaskd
    71     71 TS       - S<        0     0 writeback
    72     72 TS       - S<        0     0 kintegrityd
    73     73 TS       - S<        0     0 bioset
    74     74 TS       - S<        0     0 bioset
    75     75 TS       - S<        0     0 bioset
    76     76 TS       - S<        0     0 kblockd
    77     77 TS       - S<        0     0 md
    78     78 TS       - S<        0     0 edac-poller
    79     79 TS       - S<        0     0 watchdogd
    85     85 TS       - S         0     0 kswapd0
    86     86 TS       - SN        0     0 ksmd
    87     87 TS       - SN        0     0 khugepaged
    88     88 TS       - S<        0     0 crypto
    96     96 TS       - S<        0     0 kthrotld
    98     98 TS       - S<        0     0 kmpath_rdacd
    99     99 TS       - S<        0     0 kaluad
   101    101 TS       - S<        0     0 kpsmoused
   102    102 TS       - S<        0     0 ipv6_addrconf
   115    115 TS       - S<        0     0 deferwq
   153    153 TS       - S         0     0 kauditd
   363    363 TS       - S<        0     0 ata_sff
   367    367 TS       - S         0     0 scsi_eh_0
   368    368 TS       - S<        0     0 scsi_tmf_0
   369    369 TS       - S         0     0 scsi_eh_1
   370    370 TS       - S<        0     0 scsi_tmf_1
   375    375 TS       - S<        0     0 ttm_swap
   381    381 TS       - S<        0     0 virtscsi-scan
   382    382 TS       - S         0     0 scsi_eh_2
   383    383 TS       - S<        0     0 scsi_tmf_2
   418    418 TS       - S<        0     0 kworker/0:1H
   459    459 TS       - S<        0     0 kdmflush
   460    460 TS       - S<        0     0 bioset
   471    471 TS       - S<        0     0 kdmflush
   472    472 TS       - S<        0     0 bioset
   485    485 TS       - S<        0     0 bioset
   486    486 TS       - S<        0     0 xfsalloc
   487    487 TS       - S<        0     0 xfs_mru_cache
   488    488 TS       - S<        0     0 xfs-buf/dm-0
   489    489 TS       - S<        0     0 xfs-data/dm-0
   490    490 TS       - S<        0     0 xfs-conv/dm-0
   491    491 TS       - S<        0     0 xfs-cil/dm-0
   492    492 TS       - S<        0     0 xfs-reclaim/dm-
   493    493 TS       - S<        0     0 xfs-log/dm-0
   494    494 TS       - S<        0     0 xfs-eofblocks/d
   495    495 TS       - S         0     0 xfsaild/dm-0
   577    577 TS       - Ss   226776 128096 systemd-journal
   595    595 TS       - Ss   198572  1344 lvmetad
   611    611 TS       - Ss    45356  1756 systemd-udevd
   653    653 TS       - S         0     0 hwrng
   666    666 TS       - S<        0     0 kworker/4:1H
   707    707 TS       - S<        0     0 nfit
   758    758 TS       - S         0     0 jbd2/sda1-8
   759    759 TS       - S<        0     0 ext4-rsv-conver
   783    783 TS       - S<sl  55532  1016 auditd
   787    787 TS       - S<        0     0 rpciod
   788    788 TS       - S<        0     0 xprtiod
   812    812 TS       - Ss    69372  1360 rpcbind
   813    813 TS       - Ss    58204  2084 dbus-daemon
   814    814 TS       - Ssl  195208  1964 gssproxy
   820    820 TS       - Ssl  612360 10256 polkitd
   835    835 TS       - Ss    26808  2128 systemd-logind
   849    849 TS       - Ss    44220  2392 qemu-ga
   866    866 TS       - Ss    21592  1216 irqbalance
   872    872 TS       - Ss    47296  2212 ntpd
   874    874 TS       - Ss    12032   892 statsd-aggregat
   899    899 TS       - Ssl  528040 15476 python
   917    917 TS       - Ss   126416  1588 crond
  1004   1004 TS       - S<        0     0 kworker/6:1H
  1068   1068 TS       - Ssl  251380 41764 puppet
  1069   1069 TS       - Ssl  574280 16800 tuned
  1072   1072 TS       - Ss   116760  3480 lldpd
  1078   1078 TS       - S    116760  1660 lldpd
  1088   1088 TS       - Ssl  699632 54600 mcollectived
  1097   1097 TS       - Ssl  793528  4344 collectd
  1147   1147 TS       - Ssl  696040 70824 rsyslogd
  1170   1170 TS       - Ssl  1566400 34748 containerd
  1263   1263 TS       - SNsl 117444  4012 osqueryd
  1324   1324 TS       - Ss   100192 18488 haproxy
  1374   1374 TS       - Ssl  134592 15548 unbound
  1439   1439 TS       - SNl  882152 38596 osqueryd
  1478   1478 TS       - Ss+  110204   824 agetty
  2013   2013 TS       - S<        0     0 kworker/3:1H
  2314   2314 TS       - S<        0     0 kworker/2:1H
  2353   2353 TS       - S<        0     0 kworker/1:1H
  2407   2407 TS       - S<        0     0 kworker/11:1H
  2722   2722 TS       - S<        0     0 kworker/5:1H
  2912   2912 TS       - S<        0     0 kworker/8:1H
  2988   2988 TS       - S<        0     0 kworker/9:1H
  3105   3105 TS       - S<        0     0 kworker/7:1H
  3175   3175 TS       - S<        0     0 kworker/10:1H
  7082   7082 TS       - S         0     0 kworker/8:0
  7773   7773 TS       - Ss    89748  2176 master
  7775   7775 TS       - S     90032  4160 qmgr
 11829  11829 TS       - S         0     0 kworker/5:1
 32357  32357 TS       - Sl   914692 23396 haproxy
 56174  56174 TS       - S         0     0 kworker/0:0
 61034  61034 TS       - SNl  5387976 540992 java
 65563  65563 TS       - S     89852  4096 pickup
 71244  71244 TS       - S         0     0 kworker/11:0
 74875  74875 TS       - Sl   1582500 16360 nomad
 89110  89110 TS       - S         0     0 kworker/4:0
 96889  96889 TS       - Ss   112940  4356 sshd
 98669  98669 TS       - S         0     0 kworker/1:2
 99767  99767 TS       - S         0     0 kworker/5:2
101920 101920 TS       - S         0     0 kworker/4:2
102132 102132 TS       - S         0     0 kworker/9:2
102541 102541 TS       - Ssl  1584880 20116 nomad
102557 102557 TS       - S    526876 71184 php
102564 102564 TS       - Ssl  1723892 20600 nomad
102579 102579 TS       - S    526876 71168 php
102625 102625 TS       - Ssl  1593332 21056 nomad
102640 102640 TS       - S    526876 71160 php
102791 102791 TS       - Ssl  1519088 20836 nomad
102806 102806 TS       - S    526976 71204 php
102807 102807 TS       - Ssl  1592820 21204 nomad
102821 102821 TS       - S    526876 71192 php
103574 103574 TS       - Ssl  1519344 21496 nomad
103590 103590 TS       - S    526876 71164 php
104518 104518 TS       - Ssl  1593332 20432 nomad
104546 104546 TS       - S    533020 78512 php
104785 104785 TS       - Ssl  1462068 20392 nomad
104801 104801 TS       - S    526976 72116 php
106991 106991 TS       - Ssl  1462260 21220 nomad
107006 107006 TS       - S    620300 81832 php
107592 107592 TS       - S         0     0 kworker/10:1
107884 107884 TS       - Ssl  1527284 20952 nomad
107898 107898 TS       - S    529056 75388 php
108096 108096 TS       - Ssl  1584880 20396 nomad
108111 108111 TS       - S    528296 73336 php
108332 108332 TS       - Ssl  1715952 20632 nomad
108354 108354 TS       - S    615524 76216 php
109440 109440 TS       - Ssl  1592820 20868 nomad
109454 109454 TS       - S    595520 139612 php
109651 109651 TS       - Ssl  1527028 20004 nomad
109662 109662 TS       - S    621752 81756 php
109695 109695 TS       - Ssl  1585136 20788 nomad
109709 109709 TS       - S    621748 81604 php
110958 110958 TS       - S         0     0 kworker/3:2
111290 111290 TS       - S         0     0 kworker/0:2
111345 111345 TS       - Ssl  1593332 20300 nomad
111361 111361 TS       - S    526876 71164 php
111705 111705 TS       - Ssl  1461748 20652 nomad
111720 111720 TS       - S    556664 102212 php
111751 111751 TS       - Ssl  1585136 21416 nomad
111764 111764 TS       - S    556460 101152 php
112020 112020 TS       - Ssl  1585136 20752 nomad
112034 112034 TS       - S    577128 123328 php
112531 112531 TS       - S         0     0 kworker/7:2
113846 113846 TS       - Ssl  1724404 20612 nomad
113861 113861 TS       - S    617080 78460 php
114102 114102 TS       - Ssl  1658356 20344 nomad
114116 114116 TS       - S    545676 91108 php
115834 115834 TS       - S         0     0 kworker/7:1
115854 115854 TS       - S         0     0 kworker/u384:1
117763 117763 TS       - Sl   1692156 22444 sensu-client
117912 117912 TS       - Ssl  1650416 20056 nomad
117926 117926 TS       - S    526876 71180 php
118109 118109 TS       - Ssl  1592820 21476 nomad
118125 118125 TS       - S    531616 77716 php
118462 118462 TS       - Ssl  1650416 20908 nomad
118477 118477 TS       - S    527020 72648 php
118842 118842 TS       - S         0     0 kworker/2:1
120097 120097 TS       - Ssl  1584368 20644 nomad
120111 120111 TS       - S    550108 95432 php
120141 120141 TS       - Ssl  1592564 20160 nomad
120156 120156 TS       - S    552948 98796 php
120188 120188 TS       - Ssl  1527028 20800 nomad
120202 120202 TS       - S    527020 72652 php
121729 121729 TS       - S         0     0 kworker/8:1
123252 123252 TS       - Ssl  908288 94028 vault
123590 123590 TS       - S         0     0 kworker/6:0
123628 123628 TS       - Ssl  1789940 19976 nomad
123642 123642 TS       - S    526876 71172 php
124350 124350 TS       - R         0     0 kworker/11:1
125275 125275 TS       - Ssl  1592820 19920 nomad
125290 125290 TS       - S    655612 115244 php
125875 125875 TS       - Ssl   34196 16984 consul-template
126019 126019 TS       - Ssl  1519600 20828 nomad
126033 126033 TS       - S    638104 98824 php
126449 126449 TS       - Ssl  1584880 20664 nomad
126463 126463 TS       - S    655324 114876 php
126609 126609 TS       - Ssl  1518832 20796 nomad
126623 126623 TS       - S    623804 84068 php
126690 126690 TS       - Ssl  1658356 19992 nomad
126704 126704 TS       - S    623804 83972 php
128259 128259 TS       - S         0     0 kworker/10:2
128265 128265 TS       - S         0     0 kworker/1:0
128420 128420 TS       - Ssl  1650416 20060 nomad
128435 128435 TS       - S    567560 113432 php
128507 128507 TS       - S         0     0 kworker/u384:0
128697 128697 TS       - Ssl  1724148 20404 nomad
128712 128712 TS       - S    533188 79436 php
129487 129487 TS       - S         0     0 kworker/2:0
129760 129760 TS       - Ssl  1518896 20980 nomad
129775 129775 TS       - S    524972 70688 php
129864 129864 TS       - Ssl  1724148 19872 nomad
129879 129879 TS       - S    524972 70716 php
129886 129886 TS       - Ssl  1658612 20252 nomad
129900 129900 TS       - S    524972 70704 php
131290 131290 TS       - Ssl  1585136 20832 nomad
131304 131304 TS       - S    573820 120084 php
131314 131314 TS       - Ssl  1519088 20740 nomad
131328 131328 TS       - S    569540 114800 php
131538 131538 TS       - Ssl  1584880 20640 nomad
131553 131553 TS       - S    545012 91432 php
131892 131892 TS       - Ssl  1723636 19888 nomad
131907 131907 TS       - S    662056 208484 php
132338 132338 TS       - Ssl  1592564 20492 nomad
132353 132353 TS       - S    584024 127384 php
132638 132638 TS       - Ssl  1453296 20412 nomad
132652 132652 TS       - S    582400 126976 php
132696 132696 TS       - Ssl  1453296 20476 nomad
132711 132711 TS       - S    662448 128512 php
132763 132763 TS       - S         0     0 kworker/5:0
132843 132843 TS       - Ssl  1715696 20288 nomad
132858 132858 TS       - S    527172 72644 php
133047 133047 TS       - S         0     0 kworker/10:0
133314 133314 TS       - Ssl  1650160 21236 nomad
133329 133329 TS       - S    629748 89396 php
133358 133358 TS       - Ssl  1518832 19964 nomad
133372 133372 TS       - S    629780 89308 php
133585 133585 TS       - Ssl  1436392 20116 nomad
133598 133598 TS       - S    524972 70708 php
133677 133677 TS       - S    182500  2592 crond
133679 133679 TS       - Ss   113280  1200 sh
133681 133681 TS       - S    113284  1436 puppet_cron.sh
133689 133689 TS       - S    108052   352 sleep
133824 133824 TS       - Ssl  1518832 20456 nomad
133836 133836 TS       - S    528728 74848 php
133867 133867 TS       - Ssl  1706988 20124 nomad
133882 133882 TS       - S    524928 70704 php
133960 133960 TS       - Ssl  1518832 19452 nomad
133975 133975 TS       - R    583160 126800 php
134032 134032 TS       - Ssl  1501672 19824 nomad
134046 134046 TS       - S    673120 195104 php
134054 134054 TS       - Ssl  1575916 20740 nomad
134068 134068 TS       - S    581208 124844 php
134075 134075 TS       - Ssl  1519344 20420 nomad
134090 134090 TS       - S    524972 70700 php
134101 134101 TS       - Ssl  1576172 19980 nomad
134116 134116 TS       - S    524972 70700 php
134190 134190 TS       - Ss   158944  5660 sshd
134196 134196 TS       - D    159256  2628 sshd
134197 134197 TS       - Ss   116476  3016 bash
134264 134264 TS       - Ssl  1370600 20976 nomad
134279 134279 TS       - S    524972 70696 php
134280 134280 TS       - S    241368  4740 sudo
134281 134281 TS       - S    191876  2356 su
134282 134282 TS       - S    116492  3076 bash
134306 134306 TS       - Ssl  1567720 20456 nomad
134320 134320 TS       - S    524972 70700 php
134446 134446 TS       - Ssl  1370600 19168 nomad
134460 134460 TS       - S    524972 70712 php
134461 134461 TS       - Ssl  1445100 18784 nomad
134479 134479 TS       - S    527980 74040 php
134510 134510 TS       - Ssl  1706988 19912 nomad
134524 134524 TS       - S    524928 70740 php
134531 134531 TS       - Ssl  1501672 20268 nomad
134551 134551 TS       - S    524972 70696 php
134576 134576 TS       - Ssl  1501672 20172 nomad
134590 134590 TS       - S    532588 77588 php
134618 134618 TS       - Ssl  1444844 19828 nomad
134633 134633 TS       - S    524972 70724 php
134765 134765 TS       - Ssl  1279964 19704 nomad
134779 134779 TS       - S    540744 86900 php
134791 134791 TS       - Ssl  1289184 19652 nomad
134806 134806 TS       - R    539312 86348 php
134807 134807 TS       - Ssl  1346012 19328 nomad
134821 134821 TS       - S    524972 70708 php
134834 134834 TS       - R+   153324  1504 ps
141096 141096 TS       - Ssl  1144996 55060 dockerd
141262 141262 TS       - Ssl  10754192 8654884 nomad
143638 143638 TS       - S         0     0 kworker/9:1
145744 145744 TS       - S         0     0 kworker/3:0
147328 147328 TS       - Sl   1516708 15152 nomad
147381 147381 TS       - Sl   1516196 14716 nomad
147475 147475 TS       - Sl   1647844 14652 nomad
147510 147510 TS       - Sl   1647524 14656 nomad
147547 147547 TS       - Sl   1516196 14804 nomad
147584 147584 TS       - Sl   1647780 14728 nomad
147615 147615 TS       - Sl   1582564 15064 nomad
147694 147694 TS       - Sl   1451428 14640 nomad
147729 147729 TS       - Sl   1516708 14860 nomad
147851 147851 TS       - Sl   1516452 15040 nomad
147862 147862 TS       - Ssl  854044 116116 consul
147898 147898 TS       - Sl   1582244 14484 nomad
147984 147984 TS       - Sl   1450980 14664 nomad
148136 148136 TS       - Sl   1582244 15116 nomad
148200 148200 TS       - Sl   1517028 15480 nomad
148377 148377 TS       - Sl   1451428 14860 nomad
148480 148480 TS       - Sl   1582564 14432 nomad
148537 148537 TS       - Sl   1516452 15316 nomad
148568 148568 TS       - Sl   1451172 15472 nomad
148659 148659 TS       - Sl   1524904 15520 nomad
149012 149012 TS       - Sl   1590504 14908 nomad
149047 149047 TS       - Sl   1451428 14732 nomad
149127 149127 TS       - Sl   1451172 14828 nomad
149299 149299 TS       - Sl   1516516 14856 nomad
149447 149447 TS       - Sl   1516708 15236 nomad
149499 149499 TS       - Sl   1385892 15032 nomad
149667 149667 TS       - Sl   1516708 14596 nomad
149740 149740 TS       - Sl   1451236 14956 nomad
149776 149776 TS       - Sl   1582500 14612 nomad
149827 149827 TS       - Sl   1582500 14536 nomad
149887 149887 TS       - Sl   1581988 14540 nomad
150110 150110 TS       - Sl   1582500 15208 nomad
150185 150185 TS       - Sl   1451428 16460 nomad
150231 150231 TS       - Sl   1582500 15416 nomad
150322 150322 TS       - Sl   1525160 17252 nomad
150387 150387 TS       - Sl   1516964 15164 nomad
150459 150459 TS       - Sl   1582500 15336 nomad
150552 150552 TS       - Sl   1385892 15056 nomad
150585 150585 TS       - Sl   1647780 15200 nomad
150624 150624 TS       - Sl   1582500 14888 nomad
150686 150686 TS       - Sl   1451172 15280 nomad
150757 150757 TS       - Sl   1656232 15184 nomad
150805 150805 TS       - Sl   1451428 14668 nomad
150881 150881 TS       - Sl   1459688 17532 nomad
150924 150924 TS       - Sl   1459880 17212 nomad
150962 150962 TS       - Sl   1648356 15204 nomad
151027 151027 TS       - Sl   1516964 14588 nomad
151087 151087 TS       - Sl   1582244 14692 nomad
151128 151128 TS       - Sl   1582500 15032 nomad
151179 151179 TS       - Sl   1581988 15004 nomad
151263 151263 TS       - Sl   1525416 16052 nomad
151299 151299 TS       - Sl   1516964 15236 nomad
151358 151358 TS       - Sl   1647780 15100 nomad
151398 151398 TS       - Sl   1459688 17840 nomad
151439 151439 TS       - Sl   1451428 14852 nomad
151475 151475 TS       - Sl   1451236 15812 nomad
151518 151518 TS       - Sl   1516964 14984 nomad
151555 151555 TS       - Sl   1590696 14864 nomad
151649 151649 TS       - Sl   1516964 15260 nomad
151709 151709 TS       - Sl   1582500 15548 nomad
151779 151779 TS       - Sl   1582564 15340 nomad
151993 151993 TS       - Sl   1582244 15340 nomad
158337 158337 TS       - Sl   1582308 14656 nomad
160009 160009 TS       - S         0     0 kworker/6:1
183405 183405 TS       - Sl   1647588 14668 nomad
186937 186937 TS       - Sl   1451492 15012 nomad
186943 186943 TS       - Sl   1517220 15692 nomad
187000 187000 TS       - Sl   1451428 15464 nomad
187885 187885 TS       - Ssl  720648 14560 node_exporter

systemctl restart nomad fixes the problem temporary. We re-enabled the daily cron restart.

@bubejur
Copy link
Author

bubejur commented Dec 3, 2021

@tgross @notnoop Hi! Could you please help us?

@tgross
Copy link
Member

tgross commented Dec 3, 2021

This looks like it's #9858 but that was supposed to have been fixed in 1.1.6, so it looks like we might have missed something. We may need to try to build custom binaries again with the patch we did in release-1.1.5...b-logmon-inspect and then dig into the memory profiles again.

Some options we have here:

  • If @anastazya and @bubejur are game to try to apply that patch themselves and run it on their infrastructure, we could look at those memory profiles.
  • We know from high memory usage in logmon #9858 that the jobs restart a lot because they're cron-like, so that may be enough for us to do a bench reproduction ourselves and examine the memory profiles.
  • If neither of those options work out, we could build the binaries for y'all again and take the same approach to diagnostics as we did in high memory usage in logmon #9858.

@notnoop
Copy link
Contributor

notnoop commented Dec 4, 2021

Thanks @tgross for taking it on. I wanted to note that issue #9858 and #11261 focused on the logmon tasks. @bubejur The screenshot above highlights the nomad agent processes rather than logmon; can you confirm if you are still seeing high logmon usage as well?

I suspect that another memory leak exists in the agent side and and that the pprof endpoint data will be sufficient without needing a custom binary.

@tgross
Copy link
Member

tgross commented Dec 6, 2021

Thanks for pointing that out @notnoop. I want to back up a bit and get more specific about the symptoms and diagnostics, because the information provided is looking at two different things and there's not a lot of context for what's been given.

(1) The ps output shows that there are many many nomad processes, all but one of which will be either the shims or the logmon processes. If I grab one of your outputs and do echo $blah | grep nomad | sort -k7, then it looks like the patch to the logmon and shim processes is working as I'd have expected. There are 134 nomad processes, and only 1 of them is showing more than ~20MB of physical memory (RSS) in use. I think using up 20MB per logmon kind of stinks, but that's a known issue. The giant 6+GB process would be the Nomad agent. That means we don't need any more htop or ps dumps, because we've narrowed down the process in question. That also means this problem is not the same as the one in #9858. That's good!

(2) The htop output in the screenshots shows a single Nomad agent process (and all its userland threads, which are the green rows). The RES column (~8.5GB) for the agent does seem high for a client.

The first question this raises for me is that this agent is just a client and not also a server, right? A production client should not also be a server, because the server needs to use a lot of memory to hold the state store and that makes that memory unavailable for workloads.

Assuming that's correct, then @notnoop's suggestion to get pprof data is the next move. Set enable_debug = true, wait for the memory to get high, and then hit the following endpoints on that client (not the servers):

  • /debug/pprof/heap
  • /debug/pprof/goroutine
  • /debug/pprof/allocs

You can then bundle those files up into a tarball and send it to [email protected] with this issue number in the subject line. Alternately, you can run nomad operator debug -pprof-duration=10s -node-id=$node_id and upload that bundle instead, but it'll include server logs, etc.

Once that's done I'll acknowledge receipt on this issue and post back once I've done an analysis. Thanks!

@tgross tgross self-assigned this Dec 6, 2021
@tgross tgross changed the title Nomad high memory usage Nomad client high memory usage Dec 6, 2021
@bubejur
Copy link
Author

bubejur commented Dec 10, 2021

@notnoop @tgross hello again! Can you describe some details - what i need to do exactly? This issue is not about logmon, logmon is fine. Now nomad taking a lot of memory... So please tell me what i need to do?

@tgross
Copy link
Member

tgross commented Dec 10, 2021

@bubejur we've asked for two things:

A question we need the answer to: Is the Nomad agent running as both a server and a client? If so, then I would expect the memory usage to be very high as the server holds all the state. This is not a recommended production practice.

If the answer to that question is no...

What to do next

Set enable_debug = true in the client configuration, wait for the memory to get high, and then hit the following endpoints on that client (not the servers). That means finding the client's IP address, and visiting these addresses in your browser:

  • ${address}:4646/debug/pprof/heap
  • ${address}:4646/debug/pprof/goroutine
  • ${address}:4646/debug/pprof/allocs

Those endpoints will cause a file to be downloaded. You can then bundle those files up into a tarball and send it to [email protected] with this issue number in the subject line.

@bubejur
Copy link
Author

bubejur commented Dec 13, 2021

Ok. Done, waiting for high memory usage!

@tgross
Copy link
Member

tgross commented Dec 13, 2021

@bubejur we're still looking for the answer from you about the first question:

Is the Nomad agent running as both a server and a client?

@bubejur
Copy link
Author

bubejur commented Dec 13, 2021

@bubejur we're still looking for the answer from you about the first question:

Is the Nomad agent running as both a server and a client?

yes, you're right

root       997 21.4 10.2 4929000 1248172 ?     Ssl  Nov16 8471:31 /usr/local/bin/nomad agent -config=/etc/nomad

this is server side

@tgross
Copy link
Member

tgross commented Dec 13, 2021

Sorry, maybe I'm not communicating well. What I'm asking is if the agent that's using a lot of memory is configured for both server { enabled = true } and client { enabled = true }.

@bubejur
Copy link
Author

bubejur commented Dec 14, 2021

@tgross
server side:

    "client": {
        "enabled": false,
}
    "server": {
        "enabled": true,
}

client side:

    "client": {
        "enabled": true,
}

    "server": {
        "enabled": false,
}

@bubejur
Copy link
Author

bubejur commented Dec 14, 2021

also sent email [email protected] with tarball!

@tgross
Copy link
Member

tgross commented Dec 14, 2021

Thanks @bubejur. I took a look at that heap profile with go tool pprof -http localhost:8001 ./heap and it looks like the biggest offenders in memory usage are the golang runtime and the joincontext objects in the task runner. I can't upload a SVG into GitHub, but here's a screenshot of the heap profile showing those two items:

Screen Shot 2021-12-14 at 4 56 34 PM

My suspicion is that we're either leaking a goroutine or that we're leaking these join context objects in the prestart tasks. It might help if you ran the same profiles on the same machine again now, because that would tell us which section was growing.

@bubejur
Copy link
Author

bubejur commented Dec 15, 2021

Oh... thanks, i have only two nodes with those tasks. I stopped one of them and all tasks're now running only at 1st node. Also i restarted nomad agent on it. Gonna tell you more this evening.

@tgross
Copy link
Member

tgross commented Dec 15, 2021

@bubejur what I was really looking for there was growth of the same process memory over time, so if you could get the heap dump from the agent, and then get it again from the same process some time later (say 4 or 5 hours), that'd be useful.

@bubejur
Copy link
Author

bubejur commented Dec 15, 2021

@tgross sent you 1st part of files. Also will send ypu 2nd tomorrow morning for example.

@bubejur
Copy link
Author

bubejur commented Dec 16, 2021

Morning! Sent you 2nd part of files

@anastazya
Copy link

@tgross - I sent an e-mail containing dumps before OOM and dumps just as the server was going into swap. Hope they help !

@tgross
Copy link
Member

tgross commented Dec 20, 2021

Thanks. I've been swamped looking into another issue the last few days but I hope to look at those dumps soon.

@tgross
Copy link
Member

tgross commented Dec 21, 2021

Ok, looks pretty clear that we're leaking those join contexts. Those values keep climbing and all the associated goroutines are parked. I'll see if I can work up a reproduction that doesn't involve having to run a cluster for an extended period of time and get back to you folks once I've done so.

@tgross
Copy link
Member

tgross commented Dec 22, 2021

Dropping some more detail on my comment above. The goroutine dump from most recent shows that we've got 1532687 goroutines and 1529664 of them are in the joincontext select found at context.go#L73 (this is up from ~520k goroutines in the profile from 4 days prior). While we'd expect there to be roughly one of these in the taskrunner for each task running, presumably we don't have ~1.5M tasks running on this box! 😀

      File: nomad
Build ID: 249aa8299e799c4a2eec2fd25d12ecccd1fc0faf
Type: goroutine
Time: Dec 20, 2021 at 2:39am (EST)
Showing nodes accounting for 1534138, 100% of 1534138 total
----------------------------------------------------------+-------------
      flat  flat%   sum%        cum   cum%   calls calls% + context 	 	 
----------------------------------------------------------+-------------
                                           1532687 99.91% |   runtime.selectgo runtime/select.go:327
                                               669 0.044% |   runtime.netpollblock runtime/netpoll.go:438
                                               551 0.036% |   runtime.chanrecv runtime/chan.go:576
                                               225 0.015% |   runtime.goparkunlock runtime/proc.go:342
                                                 1 6.5e-05% |   time.Sleep runtime/time.go:193
   1534133   100%   100%    1534133   100%                | runtime.gopark runtime/proc.go:336
----------------------------------------------------------+-------------
...
         0     0%   100%    1529664 99.71%                | github.com/LK4D4/joincontext.(*joinContext).run github.com/LK4D4/[email protected]/context.go:73
                                           1529664   100% |   runtime.selectgo runtime/select.go:327
...

The heap space likewise shows that we've got a bunch of memory being used by the runtime (to be expected b/c of all those goroutines) and 236+MB of space from allocating new joincontexts at context.go#L32, up from ~72MB in the profile 4 days prior:

      File: nomad
Build ID: 249aa8299e799c4a2eec2fd25d12ecccd1fc0faf
Type: inuse_space
Time: Dec 20, 2021 at 2:38am (EST)
Showing nodes accounting for 1305.07MB, 100% of 1305.07MB total
----------------------------------------------------------+-------------
      flat  flat%   sum%        cum   cum%   calls calls% + context 	 	 
----------------------------------------------------------+-------------
                                          546.20MB   100% |   runtime.newproc1 runtime/proc.go:4065
  546.20MB 41.85% 41.85%   546.20MB 41.85%                | runtime.malg runtime/proc.go:3988
----------------------------------------------------------+-------------
                                          236.02MB   100% |   github.com/hashicorp/nomad/client/allocrunner/taskrunner.(*TaskRunner).prestart github.com/hashicorp/nomad/client/allocrunner/taskrunner/task_runner_hooks.go:240
  236.02MB 18.08% 59.94%   236.02MB 18.08%                | github.com/LK4D4/joincontext.Join github.com/LK4D4/[email protected]/context.go:32
----------------------------------------------------------+-------------

I've been digging thru the code and I'm still trying to work up a reproduction, but I feel pretty confident that joincontext is the object being leaked at this point.

@tgross
Copy link
Member

tgross commented Dec 22, 2021

Ok, I've got a working reproduction here and it looks like we needed to make sure we're canceling the join context even though I'd have expected the parent contexts to have done that. I've got a patch below that fixes the leak (also on branch b-leak-joincontext), but I want to make sure we're not breaking anything else by doing this (in particular client shutdown; this code was added in #6395 to fix #6382).

We also have this exact same pattern at drivers/client.go#L87 and device/client.go#L103, so we're most likely leaking contexts in the plugin clients too, on fingerprinting.

I'm getting towards the end of my day here, but I'll pick this back up again tomorrow. Thanks for your patience @anastazya and @bubejur!

$ git diff client/allocrunner/taskrunner/task_runner_hooks.go
diff --git a/client/allocrunner/taskrunner/task_runner_hooks.go b/client/allocrunner/taskrunner/task_runner_hooks.go
index c9ff34441..72a509e05 100644
--- a/client/allocrunner/taskrunner/task_runner_hooks.go
+++ b/client/allocrunner/taskrunner/task_runner_hooks.go
@@ -237,7 +237,9 @@ func (tr *TaskRunner) prestart() error {
                // Run the prestart hook
                // use a joint context to allow any blocking pre-start hooks
                // to be canceled by either killCtx or shutdownCtx
-               joinedCtx, _ := joincontext.Join(tr.killCtx, tr.shutdownCtx)
+               joinedCtx, joinedCancel := joincontext.Join(tr.killCtx, tr.shutdownCtx)
+               defer joinedCancel()
+
                var resp interfaces.TaskPrestartResponse
                if err := pre.Prestart(joinedCtx, &req, &resp); err != nil {
                        tr.emitHookError(err, name)

Reproduction

I took this batch job:

jobspec
job "example" {
  type        = "batch"
  datacenters = ["dc1"]

  parameterized {
    payload = "required"
  }

  group "group" {
    task "task" {

      driver = "docker"

      config {
        image   = "busybox:1"
        command = "/bin/sh"
        args    = ["-c", "cat local/payload.txt; sleep 1"]
      }

      dispatch_payload {
        file = "local/payload.txt"
      }

      resources {
        cpu    = 64
        memory = 64
      }

    }
  }
}

Run this job and dispatch it a few hundred times by letting this run for a while: while true; do nomad job dispatch example ./payload.json; sleep 2; done. Wait for all the tasks to complete. I then use go tool pprof -http localhost:8001 "http://localhost:4646/debug/pprof/goroutine" and see hundreds of joincontext goroutines floating around:

      File: nomad
Build ID: 43a315242ed58014f4fe60b1071ff22f69e145b4
Type: goroutine
Time: Dec 22, 2021 at 4:17pm (EST)
Showing nodes accounting for 544, 100% of 544 total
----------------------------------------------------------+-------------
      flat  flat%   sum%        cum   cum%   calls calls% + context
----------------------------------------------------------+-------------
                                               521 96.30% |   runtime.selectgo runtime/select.go:327
                                                10  1.85% |   runtime.netpollblock runtime/netpoll.go:454
                                                 9  1.66% |   runtime.goparkunlock runtime/proc.go:372
                                                 1  0.18% |   time.Sleep runtime/time.go:193
       541 99.45% 99.45%        541 99.45%                | runtime.gopark runtime/proc.go:366
...
----------------------------------------------------------+-------------
         0     0%   100%        384 70.59%                | github.com/LK4D4/joincontext.(*joinContext).run github.com/LK4D4/[email protected]/context.go:73
                                               384   100% |   runtime.selectgo runtime/select.go:327

before

Apply the patch and run again. After waiting until all the tasks have complete, there are no more joincontexts floating around once all the tasks have completed:

      File: nomad
Build ID: c0c30bd205428c7ce7acdbf69c76f490526b518e
Type: goroutine
Time: Dec 22, 2021 at 4:21pm (EST)
Showing nodes accounting for 162, 100% of 162 total
----------------------------------------------------------+-------------
      flat  flat%   sum%        cum   cum%   calls calls% + context
----------------------------------------------------------+-------------
                                               138 86.79% |   runtime.selectgo runtime/select.go:327
                                                11  6.92% |   runtime.netpollblock runtime/netpoll.go:454
                                                 9  5.66% |   runtime.goparkunlock runtime/proc.go:372
                                                 1  0.63% |   time.Sleep runtime/time.go:193
       159 98.15% 98.15%        159 98.15%                | runtime.gopark runtime/proc.go:366
----------------------------------------------------------+-------------

after

@tgross
Copy link
Member

tgross commented Dec 22, 2021

Going to add an addendum (mostly for my own notes) that I suspect the reason we're seeing this so easily with batch jobs is that they exit on their own, rather than being shutdown by the client. Service jobs run until they crash (not frequent) or until the server stops them, which calls one of the two contexts that make up the joined context. So I suspect we're not closing out the contexts at the end of the task runner run like we should be; I'll test that theory tomorrow.

@bubejur
Copy link
Author

bubejur commented Dec 23, 2021

@tgross that's looks great! Thank's a lot for your time!

@tgross
Copy link
Member

tgross commented Dec 23, 2021

A small program that closely matches how Nomad's taskrunner is set up demonstrates the problem:

main.go
package main

import (
	"context"
	"fmt"
	"log"
	"net/http"
	_ "net/http/pprof"
	"os"
	"os/signal"
	"runtime"
	"syscall"
	"time"

	"github.com/LK4D4/joincontext"
)

const numTaskRunners = 100

// you can play around with batchTime vs pauseTime to take
// a goroutine dump at various stages to see the behavior
const batchTime = 30
const pauseTime = 5

func main() {
	go func() {
		log.Println(http.ListenAndServe("localhost:6060", nil))
	}()

	shutdownCtx, shutdownFn := context.WithCancel(context.Background())

	go trap(shutdownFn)

	serviceRunners := []*TaskRunner{}

	fmt.Println("starting runners")
	for i := 0; i < numTaskRunners; i++ {
		r := NewTaskRunner(shutdownCtx, "service")
		serviceRunners = append(serviceRunners, r)
		go r.Run()
	}
	for i := 0; i < numTaskRunners; i++ {
		r := NewTaskRunner(shutdownCtx, "batch")
		go r.Run()
	}

	// make sure we have at least one runner going so that we can
	// expect to see exactly 1 goroutine for joincontext when we're
	// done
	waiter := NewTaskRunner(shutdownCtx, "service")
	go waiter.Run()
	time.Sleep(pauseTime * time.Second)

	fmt.Println("stopping runners")
	for _, r := range serviceRunners {
		r.Stop()
	}

	runtime.GC()
	<-shutdownCtx.Done()
}

func trap(shutdownFn context.CancelFunc) {
	c := make(chan os.Signal, 1)
	signal.Notify(c, syscall.SIGINT)
	<-c
	shutdownFn()
}

type TaskRunner struct {
	taskType string

	preStartHooksRun int
	shutdownCtx      context.Context
	killCtx          context.Context
	killCancelFn     context.CancelFunc
	waitCh           chan struct{}
}

func NewTaskRunner(shutdownCtx context.Context, taskType string) *TaskRunner {
	ctx, cancel := context.WithCancel(context.Background())
	return &TaskRunner{
		taskType:     taskType,
		shutdownCtx:  shutdownCtx,
		killCtx:      ctx,
		killCancelFn: cancel,
		waitCh:       make(chan struct{}),
	}
}

func (tr *TaskRunner) Run() {
	joinedCtx, joinCancel := joincontext.Join(tr.killCtx, tr.shutdownCtx)
	// NOTE: leave this out and we'll end up with 100
	// leaked join contexts from the batch job
	defer joinCancel()

	for i := 0; i < 10; i++ {
		tr.prestartHook(joinedCtx)
	}
	go tr.runTask()
}

func (tr *TaskRunner) prestartHook(ctx context.Context) {
	tr.preStartHooksRun++
}

// runTask simulates the task plugin interface
func (tr *TaskRunner) runTask() {

	if tr.taskType == "batch" {
		go func() {
			<-time.After(batchTime * time.Second)
			close(tr.waitCh)
		}()
	}

	select {
	case <-tr.killCtx.Done():
	case <-tr.shutdownCtx.Done():
	case <-tr.waitCh:

	}
}

func (tr *TaskRunner) Stop() {
	tr.killCancelFn()
}

func (tr *TaskRunner) Kill(ctx context.Context) error {

	tr.killCancelFn()

	select {
	case <-tr.waitCh:
	case <-ctx.Done():
		return ctx.Err()
	}

	return nil
}

@tgross
Copy link
Member

tgross commented Dec 23, 2021

#11741 is up with that patch and should go out with the next scheduled release (shortly after the new year), but it'd be a safe patch to backport to your own build if you want to land a fix in your environment before that.

@bubejur
Copy link
Author

bubejur commented Jan 10, 2022

Thanks a lot! Will be waiting for new release!

@bubejur
Copy link
Author

bubejur commented Jan 25, 2022

@notnoop @tgross hi guys! I made an update for 1.2.4 but got another issue with consul templating:

Jan 25 08:44:53 microworker03.te01-shr.nl3 nomad: 2022-01-25T08:44:53.234Z [INFO]  client.alloc_runner.task_runner.task_hook.logmon.nomad: opening fifo: alloc_id=a7c04d65-2f29-c778-c34c-2513d29f25f4 task=worker-mpi-resolver @module=logmon path=/data/nomad/alloc/a7c04d65-2f29-c778-c34c-2513d29f25f4/alloc/logs/.worker-mpi-resolver.stdout.fifo timestamp=2022-01-25T08:44:53.234Z
Jan 25 08:44:53 microworker03.te01-shr.nl3 nomad[4342]: client.alloc_runner.task_runner.task_hook.logmon.nomad: opening fifo: alloc_id=a7c04d65-2f29-c778-c34c-2513d29f25f4 task=worker-mpi-resolver @module=logmon path=/data/nomad/alloc/a7c04d65-2f29-c778-c34c-2513d29f25f4/alloc/logs/.worker-mpi-resolver.stdout.fifo timestamp=2022-01-25T08:44:53.234Z
Jan 25 08:44:53 microworker03.te01-shr.nl3 nomad: 2022-01-25T08:44:53.234Z [INFO]  client.alloc_runner.task_runner.task_hook.logmon.nomad: opening fifo: alloc_id=a7c04d65-2f29-c778-c34c-2513d29f25f4 task=worker-mpi-resolver @module=logmon path=/data/nomad/alloc/a7c04d65-2f29-c778-c34c-2513d29f25f4/alloc/logs/.worker-mpi-resolver.stderr.fifo timestamp=2022-01-25T08:44:53.234Z
Jan 25 08:44:53 microworker03.te01-shr.nl3 nomad[4342]: client.alloc_runner.task_runner.task_hook.logmon.nomad: opening fifo: alloc_id=a7c04d65-2f29-c778-c34c-2513d29f25f4 task=worker-mpi-resolver @module=logmon path=/data/nomad/alloc/a7c04d65-2f29-c778-c34c-2513d29f25f4/alloc/logs/.worker-mpi-resolver.stderr.fifo timestamp=2022-01-25T08:44:53.234Z
Jan 25 08:44:53 microworker03.te01-shr.nl3 nomad: 2022-01-25T08:44:53.965Z [INFO]  agent: (runner) creating new runner (dry: false, once: false)
Jan 25 08:44:53 microworker03.te01-shr.nl3 nomad[4342]: agent: (runner) creating new runner (dry: false, once: false)
Jan 25 08:44:53 microworker03.te01-shr.nl3 nomad: 2022-01-25T08:44:53.966Z [INFO]  agent: (runner) creating watcher
Jan 25 08:44:53 microworker03.te01-shr.nl3 nomad: 2022-01-25T08:44:53.966Z [INFO]  agent: (runner) starting
Jan 25 08:44:53 microworker03.te01-shr.nl3 nomad[4342]: agent: (runner) creating watcher
Jan 25 08:44:53 microworker03.te01-shr.nl3 nomad[4342]: agent: (runner) starting
Jan 25 08:44:54 microworker03.te01-shr.nl3 nomad: 2022-01-25T08:44:54.307Z [INFO]  client.gc: marking allocation for GC: alloc_id=a7c04d65-2f29-c778-c34c-2513d29f25f4
Jan 25 08:44:54 microworker03.te01-shr.nl3 nomad[4342]: client.gc: marking allocation for GC: alloc_id=a7c04d65-2f29-c778-c34c-2513d29f25f4
Jan 25 08:44:58 microworker03.te01-shr.nl3 nomad: 2022-01-25T08:44:58.309Z [WARN]  client.alloc_runner.task_runner.task_hook.logmon.nomad: timed out waiting for read-side of process output pipe to close: alloc_id=a7c04d65-2f29-c778-c34c-2513d29f25f4 task=worker-mpi-resolver @module=logmon timestamp=2022-01-25T08:44:58.309Z
Jan 25 08:44:58 microworker03.te01-shr.nl3 nomad[4342]: client.alloc_runner.task_runner.task_hook.logmon.nomad: timed out waiting for read-side of process output pipe to close: alloc_id=a7c04d65-2f29-c778-c34c-2513d29f25f4 task=worker-mpi-resolver @module=logmon timestamp=2022-01-25T08:44:58.309Z
Jan 25 08:44:58 microworker03.te01-shr.nl3 nomad: 2022-01-25T08:44:58.309Z [WARN]  client.alloc_runner.task_runner.task_hook.logmon.nomad: timed out waiting for read-side of process output pipe to close: alloc_id=a7c04d65-2f29-c778-c34c-2513d29f25f4 task=worker-mpi-resolver @module=logmon timestamp=2022-01-25T08:44:58.309Z
Jan 25 08:44:58 microworker03.te01-shr.nl3 nomad[4342]: client.alloc_runner.task_runner.task_hook.logmon.nomad: timed out waiting for read-side of process output pipe to close: alloc_id=a7c04d65-2f29-c778-c34c-2513d29f25f4 task=worker-mpi-resolver @module=logmon timestamp=2022-01-25T08:44:58.309Z

Nomad side:

Template failed: /data/nomad/alloc/3a20b272-9965-8c1f-6ab0-c841e303b623/worker-mpi-resolver/local/platformConfig/nl3.tmpl: execute: template: :1:36: executing "" at <plugin "/data/tools/consul.php">: error calling plugin: function is disabled
--


<br class="Apple-interchange-newline" style="caret-color: rgb(0, 0, 0); color: rgb(0, 0, 0); font-style: normal; font-variant-caps: normal; font-weight: normal; letter-spacing: normal; orphans: auto; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-size-adjust: auto; -webkit-text-stroke-width: 0px; text-decoration: none;">

@bubejur
Copy link
Author

bubejur commented Jan 25, 2022

image

@github-actions
Copy link

I'm going to lock this issue because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active issues.
If you have found a problem that seems similar to this, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

@github-actions github-actions bot locked as resolved and limited conversation to collaborators Oct 12, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
Development

Successfully merging a pull request may close this issue.

5 participants