Use USS (unique set size) instead of PSS for all the things #16570

jrafanie · 2017-11-30T20:54:44Z

Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1479356

Builds on top of #16569 [MERGED]

Change worker validation to check USS not PSS
Show USS instead of PSS in rake evm:status
Also log the server/worker unique set size (USS)

The first bullet point should alleviate the false positives where a large server process was causing new forked workers to exceed PSS memory thresholds very quickly. If the server process grows over time, that's a separate problem. We shouldn't flag and kill workers that inherited a large shared memory, only ones that grow beyond the thresholds on their own.

For prior versions such as euwe and fine, we can accomplish a similar result by storing the USS in the existing PSS column:

jrafanie · 2017-11-30T21:18:55Z

@miq-bot add_label gaprindashvili/yes
@miq-bot add_label performance
@miq-bot add_label core/workers

jrafanie · 2017-12-01T19:02:12Z

@gtanzillo @Fryguy this is now ready for review with the prior PR merged cc @dmetzger57 @NickLaMuro

carbonin

Looks good to me ... any reason this can't go in?

@jrafanie does this address a specific bug or is this a general fix?

jrafanie · 2017-12-15T21:53:04Z

Good point, let me put a BZ link @carbonin

https://bugzilla.redhat.com/show_bug.cgi?id=1479356

Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1479356 Why? USS is a more reliable mechanism for tracking workers with runaway memory growth. PSS is great, until the server process that forks new processes grows large. As each new worker is forked, it inherits a share of the large amount of the parent process' memory and therefore starts with a large PSS, possibly exceeding our limits before doing any work. USS only measures a process' private memory and is a better indicator when a process is responsible for allocating too much memory without freeing it.

miq-bot · 2017-12-15T21:58:24Z

Checked commits jrafanie/manageiq@2d75877~...ef84a88 with ruby 2.3.3, rubocop 0.47.1, haml-lint 0.20.0, and yamllint 1.10.0
4 files checked, 0 offenses detected
Everything looks fine. 🍪

Use USS (unique set size) instead of PSS for all the things (cherry picked from commit eee0570) Fixes https://bugzilla.redhat.com/show_bug.cgi?id=1527093

simaishi · 2017-12-18T14:43:38Z

Gaprindashvili backport details:

$ git log -1
commit 0c79e86b72da95bde62b998f274d3080080f0f58
Author: Nick Carboni <[email protected]>
Date:   Fri Dec 15 17:20:56 2017 -0500

    Merge pull request #16570 from jrafanie/USS_all_the_things
    
    Use USS (unique set size) instead of PSS for all the things
    (cherry picked from commit eee05705f47ca91d3785dc4caed3fd7188f5817c)
    
    Fixes https://bugzilla.redhat.com/show_bug.cgi?id=1527093

miq-bot added the wip label Nov 30, 2017

jrafanie force-pushed the USS_all_the_things branch from 2100ee9 to 76db86c Compare November 30, 2017 21:17

miq-bot added gaprindashvili/yes performance core/workers labels Nov 30, 2017

This was referenced Nov 30, 2017

[GAPRINDASHVILI][POC] Store unique set size (USS) in the PSS column ManageIQ/manageiq-gems-pending#314

Closed

[POC] Store unique set size (USS) in the PSS column ManageIQ/manageiq-gems-pending#312

Closed

jrafanie closed this Dec 1, 2017

jrafanie reopened this Dec 1, 2017

jrafanie changed the title ~~[WIP] Use USS (unique set size) instead of PSS for all the things~~ Use USS (unique set size) instead of PSS for all the things Dec 1, 2017

jrafanie force-pushed the USS_all_the_things branch from 76db86c to 14d890a Compare December 1, 2017 19:01

jrafanie removed the wip label Dec 1, 2017

This was referenced Dec 14, 2017

[EUWE] [POC] Store unique set size (USS) in the PSS column #16480

Merged

[FINE][POC] Store unique set size (USS) in the PSS column ManageIQ/manageiq-gems-pending#313

Merged

carbonin approved these changes Dec 15, 2017

View reviewed changes

jrafanie added 2 commits December 15, 2017 16:55

Also log the server/worker unique set size (USS)

2d75877

https://bugzilla.redhat.com/show_bug.cgi?id=1479356

Show USS instead of PSS in rake evm:status

eb028e0

https://bugzilla.redhat.com/show_bug.cgi?id=1479356

jrafanie force-pushed the USS_all_the_things branch from 14d890a to 04c8995 Compare December 15, 2017 21:55

jrafanie force-pushed the USS_all_the_things branch from 04c8995 to ef84a88 Compare December 15, 2017 21:55

carbonin self-assigned this Dec 15, 2017

carbonin added the bug label Dec 15, 2017

carbonin added this to the Sprint 76 Ending Jan 1, 2018 milestone Dec 15, 2017

carbonin merged commit eee0570 into ManageIQ:master Dec 15, 2017

simaishi added gaprindashvili/backported and removed gaprindashvili/yes labels Dec 18, 2017

jrafanie deleted the USS_all_the_things branch January 24, 2018 20:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use USS (unique set size) instead of PSS for all the things #16570

Use USS (unique set size) instead of PSS for all the things #16570

jrafanie commented Nov 30, 2017 •

edited

Loading

jrafanie commented Nov 30, 2017

jrafanie commented Dec 1, 2017

carbonin left a comment

jrafanie commented Dec 15, 2017

miq-bot commented Dec 15, 2017

simaishi commented Dec 18, 2017

Use USS (unique set size) instead of PSS for all the things #16570

Use USS (unique set size) instead of PSS for all the things #16570

Conversation

jrafanie commented Nov 30, 2017 • edited Loading

jrafanie commented Nov 30, 2017

jrafanie commented Dec 1, 2017

carbonin left a comment

Choose a reason for hiding this comment

jrafanie commented Dec 15, 2017

miq-bot commented Dec 15, 2017

simaishi commented Dec 18, 2017

jrafanie commented Nov 30, 2017 •

edited

Loading