feat: configure flavors in Nova automatically #499

skrobul · 2024-11-19T18:24:31Z

This PR brings a little operator style application that monitors a directory with YAML specifications of the machine flavors. If any of the files in the linked repository changes, the application parses all the specs in a directory, then compares and reconciles them with the configured Nova instance.

Please note that this will NOT delete (prune) flavors that have been removed from the repository.

Closes https://rackspace.atlassian.net/browse/PUC-598
Supersedes #454

Merge with https://github.com/RSS-Engineering/undercloud-deploy/pull/224

cardoe

I'm still wondering why not just using ansible and having a playbook for managing a flavor and it takes in the flavor spec as an input?

cardoe · 2024-11-20T21:31:52Z

components/keystone/aio-values.yaml

+    # create 'flavorsync' user to allow synchronization of the flavors to nova
+    openstack user create --or-show --domain service --password abcd1234 flavorsync
+    openstack role create --or-show --domain service flavorsync
+    openstack role add --user-domain service --user flavorsync flavorsync


We really shouldn't be creating random users with random passwords. We should be creating application credentials for our service account.

Agreed. This is just an example that is supposed to be overriden in the deploy repository (just as it is for the argoworkflow and monitoring users above and below.

cardoe · 2024-11-20T21:36:49Z

containers/nova-flavors/Dockerfile.nova-flavors

+
+# This section needs to be repeated in child images
+ARG APP_PATH=/app
+ARG APP_USER=appuser
+ARG APP_GROUP=appgroup
+ARG APP_USER_UID=1000
+ARG APP_GROUP_GID=1000
+
+


This just feels really heavy in every container. Can we not just do:

securityContext: allowPrivilegeEscalation: false capabilities: drop: - ALL readOnlyRootFilesystem: true runAsNonRoot: true

for the pod?

if we really wanted to be slim we can use the distroless. https://github.com/GoogleContainerTools/distroless/blob/main/examples/python3/Dockerfile

This just feels really heavy in every container.

If the verbosity is the problem, we can remove some of these by hardcoding the values in the commands. However having them as arguments is no "Shotgun Surgery" when the UID needs to be adjusted or path changed.

Can we not just do:

allowPrivilegeEscalation: false capabilities: drop: - ALL readOnlyRootFilesystem: true runAsNonRoot: true

for the pod?

Configuring securityContext in Kubernetes and setting user/group IDs in a Dockerfile are two complementary, but not identical things.

If we don't declare USER in a Dockerfile, the container will default to uid=0 / root. Now, for example, if you configured a Pod or container securityContext with runAsNonRoot: true and used such an image, the kubelet will simply refuse to start it.

In theory, you can extend your securityContext with runAsUser: 1001 and the container will start, but that brings another set of problems:

Application files inside the container are still owned by root, because you have not configured the USER during the build. This can be especially problematic for files that need to be writable (pids, logs, caches, etc.).

Processes running inside the container run with the EUID of a user that does not exist in the system from their point of view.

Files owned by the specified UID will not have a corresponding username displayed, causing confusion during debugging or file inspection.

Commands like whoami and tools relying on user information might fail or return errors, leading to broken workflows or logging issues.

Logs may show only numeric UIDs instead of meaningful usernames, complicating troubleshooting and security audits.

If the same image is started in development on a developer's laptop, it will start as root, which could lead to a false sense of things working until they are pushed to Kubernetes.

if we really wanted to be slim we can use the distroless. https://github.com/GoogleContainerTools/distroless/blob/main/examples/python3/Dockerfile

The distroless are generally larger than Alpine images. python:3.12.2-alpine3.19 is 51.8MB and gcr.io/distroless/python3 is 52.8MB. The Alpine one includes at least some of the debugging tools and have a shell. Having said that, we can certainly look into this further and maybe use the distroless ones for production as theoretically they would provide better security.

So I think I'd be more okay with it if we pinned ourselves to something rather than making it variable and having to carry that around. Because then we'd do the security piece and the runAsUser: 1001.

I'm happy to work on that change, but it will not be part of this PR as it requires changes to multiple Dockerfiles that are not related to this feature

Opened #515 - if it gets merged before this one, I'll adjust the code as necessary

python/understack-flavor-matcher/flavor_matcher/flavor_spec.py

python/understack-flavor-matcher/tests/test_flavor_spec.py

skrobul · 2024-11-21T10:21:23Z

I'm still wondering why not just using ansible and having a playbook for managing a flavor and it takes in the flavor spec as an input?

OK, that's fair question - let me break this down. At the moment the Flavor specifications are presented as collection of YAML files in a specific structured format.
For each of those files, we have to execute some business logic in order to make them usable. Some of those tasks are:

parsing the input and validating that given flavor is defined correctly
filtering out the records that are not pertinent to a given environment
for the data itself, there are various unit conversions needed. For example, the specs have the memory size defined in gigabytes, but Nova expect them to be defined in mebibytes and the BMC/Redfish on the baremetal node reports them as megabytes.
for each of the flavors, the name will be different depending on the context - for example specification will say nonprod.gp2.small but in Nova this needs to have a resource class of resources:CUSTOM_BAREMETAL_GP2SMALL and in Ironic, this has to be baremetal.gp2small.
since Nova API does not support 'update' operation (unless you are updating just the description), to update flavors they need to be deleted and recreated. It's good to do that only when needed.

Now, you could of course implement all this logic inside of Ansible (which really is just a YAML wrapper around Python), but I am pretty sure it will get messy and untestable pretty quickly. You could write a custom module for Ansible, which would again be just another Python script.

Alternatively, you could manually write out playbooks with some of this data precomputed. However, this approach would result in duplicated data in various formats, which would make it more error-prone and require more work to maintain.

For the sake of argument, let's say we decided to use Ansible for all of this and you have a role that sets up flavors in Nova. Now you need to make sure that you run this playbook every time someone adds or removes a new flavor. As far as I know, we do not have the machinery to do that at the moment, so it would probably require additional dependencies like Argo to detect repository changes and yet another container image to execute the Ansible.

On top of all that, the testability of such solution would be difficult to achieve and quite convoluted.

Based on these considerations, I believe implementing this in a programming language like Python is a more suitable approach than using a markup language.

This adds code that performs simple synchronization of the Flavor Specs defined in a YAML files in particular directory (presumably downloaded through git-sync) with a chosen Nova API instance.

Prior to this change it was down to the client of FlavorSpec to filter out the irrelevant flavors, but now for consistency this filtering happens immediately when the flavors are loaded.

This reverts commit 5ac057b which was added in order to resolve the mismatch between the nova and ironic flavors that had a punctuation in the name. The reverted commit incorrectly changed the resource class name on the Ironic side while it was the Nova side that was missing the underscores. This was resolved in the commit before this one.

Based on the discussion in #499 (comment) this commit removes the repetitive directives to set the application UIDs, GIDs as well as the name and home directory. The new, standardised way is to use: - user with name appuser and UID 1000 - group with name appgroup and GID 1000 - location of the code in /app

skrobul force-pushed the nova-flavor-monitor branch 9 times, most recently from 72c0159 to 87d585e Compare November 20, 2024 16:15

skrobul changed the title ~~WIP: Nova flavor monitor~~ feat: configure flavors in Nova automatically Nov 20, 2024

skrobul marked this pull request as ready for review November 20, 2024 16:17

skrobul force-pushed the nova-flavor-monitor branch from 87d585e to 9717207 Compare November 20, 2024 16:23

skrobul requested a review from a team November 20, 2024 16:26

skrobul force-pushed the nova-flavor-monitor branch from 9717207 to a615166 Compare November 20, 2024 17:08

cardoe reviewed Nov 20, 2024

View reviewed changes

nicholaskuechler reviewed Nov 20, 2024

View reviewed changes

python/understack-flavor-matcher/flavor_matcher/flavor_spec.py Outdated Show resolved Hide resolved

python/understack-flavor-matcher/tests/test_flavor_spec.py Outdated Show resolved Hide resolved

skrobul added 13 commits November 25, 2024 11:01

flavor_spec: add memory_mib and baremetal_nova_resource_class helpers

a519903

nova_flavors: initial code to reconcile with Nova

c668e69

This adds code that performs simple synchronization of the Flavor Specs defined in a YAML files in particular directory (presumably downloaded through git-sync) with a chosen Nova API instance.

nova_flavors: watch filesystem for modifications

b330f38

FlavorSpec: pre-filter specs on loading

612ad62

Prior to this change it was down to the client of FlavorSpec to filter out the irrelevant flavors, but now for consistency this filtering happens immediately when the flavors are loaded.

keystone: setup serviceaccount and role for flavorsync

1d57e4f

nova_flavors: reorganise into a package

6ee78ea

nova_flavors: add Docker image

fc232f3

nova_flavors: build container image in GH

910dae5

nova_flavors: add tests for flavor_synchronizer

861da7b

nova_flavors: add tests for handler

3b1207a

nova_flavors: add tests for reconcile.py

910bc89

flavor_matcher: add missing imports

fd4e4fa

nova_flavors: stop using hardcoded IDs for auth

caca9bf

skrobul added 5 commits November 25, 2024 11:02

nova_flavors: remove awkward global

7fdbc12

nova_flavors: fix the casing on the extra_specs

2f0360d

nova_flavors: fix the role assignments

0f25402

nova_flavors: fix punctuation handling in nova resource class

0050224

skrobul force-pushed the nova-flavor-monitor branch from 99bb0d4 to d504e52 Compare November 25, 2024 11:05

skrobul mentioned this pull request Nov 25, 2024

feat: Hardcode uids in dockerfiles #515

Open

skrobul force-pushed the nova-flavor-monitor branch from aa13c44 to 069abbc Compare November 27, 2024 12:09

enroll: make flavor spec directory configurable

ad51616

skrobul force-pushed the nova-flavor-monitor branch from 069abbc to ad51616 Compare November 27, 2024 12:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: configure flavors in Nova automatically #499

feat: configure flavors in Nova automatically #499

skrobul commented Nov 19, 2024 •

edited

Loading

cardoe left a comment

cardoe Nov 20, 2024

skrobul Nov 21, 2024

cardoe Nov 20, 2024

skrobul Nov 21, 2024 •

edited

Loading

cardoe Nov 21, 2024

skrobul Nov 25, 2024

skrobul Nov 25, 2024

skrobul commented Nov 21, 2024 •

edited

Loading

feat: configure flavors in Nova automatically #499

Are you sure you want to change the base?

feat: configure flavors in Nova automatically #499

Conversation

skrobul commented Nov 19, 2024 • edited Loading

cardoe left a comment

Choose a reason for hiding this comment

cardoe Nov 20, 2024

Choose a reason for hiding this comment

skrobul Nov 21, 2024

Choose a reason for hiding this comment

cardoe Nov 20, 2024

Choose a reason for hiding this comment

skrobul Nov 21, 2024 • edited Loading

Choose a reason for hiding this comment

cardoe Nov 21, 2024

Choose a reason for hiding this comment

skrobul Nov 25, 2024

Choose a reason for hiding this comment

skrobul Nov 25, 2024

Choose a reason for hiding this comment

skrobul commented Nov 21, 2024 • edited Loading

skrobul commented Nov 19, 2024 •

edited

Loading

skrobul Nov 21, 2024 •

edited

Loading

skrobul commented Nov 21, 2024 •

edited

Loading