Fix security config race condition #1718

sanders41 · 2022-11-08T18:20:34Z

Closes

Code Changes

Pre-load the config in the __init__.py
Load the security setting before loading the full model
init SecuritySettings directly if the section doesn't exist in the toml config file
Added _FUNCS.clear() to clear Pydantic's validator cache before loading
DO NOT include the security defaults in the fides.toml file

Steps to Confirm

Build the container nox -s "build(dev)"
docker run -it ethyca/fides:local fails on the missing config items
docker run -e FIDES__USER__ANALYTICS_OPT_OUT=True -e FIDES__SECURITY__APP_ENCRYPTION_KEY=kfkdkslaksldkfjdkslaksldkfjskald -e FIDES__SECURITY__OAUTH_ROOT_CLIENT_SECRET=rootclient -e FIDES__SECURITY__OAUTH_ROOT_CLIENT_ID=someclient ethyca/fides:local passes

Pre-Merge Checklist

All CI Pipelines Succeeded
Documentation Updated:
- documentation complete, or draft/outline provided (tag docs-team to complete/review on this branch)
- documentation issue created (tag docs-team to complete issue separately)
Issue Requirements are Met
Relevant Follow-Up Issues Created
Update CHANGELOG.md

Description Of Changes

Fixes the issue where the security config is not properly loading from environment variables.

The root cause of this is Pydantic was delaying the loading of nested models when using environment variables. Because of this if the nested value was accessed before it was loaded an error occurred. The fix was to pre-load the nested model and add it to the parent model at initialization when loading from environment variables.

In doing this pre-loading it turned up a new issue where in some environments (CI for example) Pydantic would through an error that validators were being reused, while in other environments (running locally for example) no error occurred. This seems to occur because of a conflict between server cache and Pydantic's cache. I did try removing our lru_cache to see if that was involved in the issue, but it made no difference. The solution turned out to be clearing Pydantic's cache before loading the model with _FUNCS.clear(). More context for this can be found here.

Other changes in this PR, bumping Pydantic and bringing fideslib code directly in, were done as attempts to solve this Pydantic issue before the cache clearing option was found. In the end they didn't make any difference, but we left here since this change is planned as part of #1572.

sanders41 · 2022-11-08T19:22:38Z

@ThomasLaPiana these tests are passing when I run them locally. Any chance the failures could be related to the recent CI updates?

ThomasLaPiana · 2022-11-09T04:42:01Z

@sanders41 no, the latest main merge shows only 4 tests failing, all known-bad (Shopify errors + the Admin UI Cypress tests)

These are probably from this PR

ThomasLaPiana · 2022-11-09T04:44:06Z

this seems like a pretty sprawling change, simple but touches a ton of code. I'm sure you've already thought about it, but maybe there's a way to add one or two checks somewhere instead of in every function that might need it? This will get tricky to maintain/revert/document

sanders41 · 2022-11-09T06:27:23Z

I tried several other ways to do the checks, but mypy and/or loading was failing any other way. I don’t like the way I did this either, but I couldn’t come up with another way.

Troubleshooting the failures is also going to be difficult since they pass locally and the error is with the database config which didn’t change. The only difference I can think of right now is dev vs production builds, but don’t see why that would matter.

sanders41 · 2022-11-09T23:08:49Z

I'm still struggling to reproduce this error locally. I tried building and testing with the production image and that didn't do it either.

I also tried running with no volumes mounted to be sure it wasn't something in the fides.toml making the difference.

sanders41 · 2022-11-10T13:01:16Z

Possibly related nazrulworld/fhir.resources#41. If so it could be a caching issue. It's the same scenario, but in the case of this issue it was with lambda and not github actions.

NevilleS · 2022-11-10T21:04:13Z

Hey @sanders41, I think you can reproduce this pretty easily by doing the following:

docker run ethyca/fides:local

...and then start layering in minimal ENV variables, like:

docker run -e FIDES__USER__ANALYTICS_OPT_OUT=True -e FIDES__SECURITY__APP_ENCRYPTION_KEY=keygoeshere ethyca/fides:local

sanders41 · 2022-11-10T21:09:53Z

This is what I did and it was running without error locally. I got it working both locally and in ci now, but the docs build is failing still. running a test now where I hope I have that fixed also.

ThomasLaPiana · 2022-11-13T17:25:45Z

im testing this in the plus PR and still seeing the security errors....still digging

ThomasLaPiana · 2022-11-13T17:36:58Z

If there is a config file, it will fail to init from the environment variables for the security settings

ThomasLaPiana · 2022-11-13T18:48:54Z

fixed the docs checks, next step is to fix the config tests (they need the required env vars now!)

…s init

src/fides/ctl/core/config/__init__.py

ThomasLaPiana · 2022-11-14T09:37:20Z

@NevilleS @sanders41 I want to call out that now, if a user does a pip install ethyca-fides, they will not be able to do anything! until they've set these values, and the failure will happen before we have a chance to tell them

sanders41 · 2022-11-14T11:13:47Z

Would deferring the loading of the config help with this? Adding the load to the __init__.py was one of the first things done in trouble shooting and it did help, but there have been a lot of other changes since then so we might be able to get away with removing it.

ThomasLaPiana · 2022-11-14T16:28:43Z

@sanders41 I think we can cross check this PR and then merge it, and do that UX improvements in another PR as this one is getting big

TheAndrewJackson · 2022-11-14T17:47:30Z

I think everything is working on my end. It was able to load the config but failed when trying to connect to the db with the provided snippet

docker run -e FIDES__USER__ANALYTICS_OPT_OUT=True -e FIDES__SECURITY__APP_ENCRYPTION_KEY=kfkdkslaksldkfjdkslaksldkfjskald -e FIDES__SECURITY__OAUTH_ROOT_CLIENT_SECRET=rootclient -e FIDES__SECURITY__OAUTH_ROOT_CLIENT_ID=someclient ethyca/fides:local

I think that is expected because no db config is provided so it used default-db. Here is the error for reference

sqlalchemy.exc.OperationalError: (psycopg2.OperationalError) could not translate host name "default-db" to address: Name or service not known

ThomasLaPiana · 2022-11-15T12:56:35Z

I got this branch working with the plus PR, so I'm going to merge this. Will immediately open a follow-up ticket around the user experience

Paul Sanders added 2 commits November 8, 2022 13:19

Fix security config race condition

22df5c9

Fix pylint error

05d9ed5

Paul Sanders added 4 commits November 9, 2022 09:05

Merge remote-tracking branch 'origin/main' into ps-config

d3e06e0

Refactor to allow loading security settings earlier

aace363

Use local database settings

56bc381

Pre-load database settings

f063e1f

Desperate attempt at debugging CI

73f9cad

Paul Sanders added 14 commits November 10, 2022 08:06

Remove unsuccessful debugging attempt

3f07a2c

Bump pydantic to see if it will fix CI issue

850b202

More CI debugging

d3971c1

Fix python version in debugging

a47e718

Fix typo

b65ff5f

Isolate docker build

dfb4181

Remove debugging

68fe613

Pull more fideslib code directly in

0060dd1

Fix install error

4f1abd3

Add missing argument

1bea527

Clear pydantic validator cache

bf67a37

Move cache clearing

0aeafa1

Add required security settings to test file

8f074a7

Add environment vairables to docs

6e444f9

Fix docs environment vairables

16549d5

ssangervasi mentioned this pull request Nov 12, 2022

privacy-center: Introduce fides-consent script package #1756

Merged

8 tasks

ThomasLaPiana added 2 commits November 14, 2022 00:39

code comment cleanup

fd5f833

refactor the new minimal config check to use the existing matrix pattern

35a2ecc

ThomasLaPiana added 3 commits November 14, 2022 01:37

remove the default security settings from the toml

f8d4b81

get the config loading with partial env definition

8e85f7c

fix the docs check

2b23404

fix some of the config tests

ffe09b4

ThomasLaPiana assigned ThomasLaPiana and sanders41 Nov 14, 2022

ThomasLaPiana added 4 commits November 14, 2022 16:40

fix the remaining config test failures

d39dec6

fix mypy errors

00ac243

remove the explicit security setting field from the config file

118f739

fix "check_install", skip validation for the initial security setting…

e70c5ea

…s init

ThomasLaPiana reviewed Nov 14, 2022

View reviewed changes

src/fides/ctl/core/config/__init__.py Show resolved Hide resolved

re-add the required vars to the toml, since it is being docker ignored

9433107

Merge branch 'main' into ps-config

8a2db7c

ThomasLaPiana mentioned this pull request Nov 15, 2022

Improve UX for missing required config variables #1787

Closed

ThomasLaPiana self-requested a review November 15, 2022 13:54

ThomasLaPiana approved these changes Nov 15, 2022

View reviewed changes

ThomasLaPiana merged commit a2c5280 into main Nov 15, 2022

ThomasLaPiana deleted the ps-config branch November 15, 2022 13:54

sanders41 mentioned this pull request Dec 15, 2022

Verify unified fides can run without a fides/ctl/ops.toml file #1125

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix security config race condition #1718

Fix security config race condition #1718

sanders41 commented Nov 8, 2022 •

edited by ThomasLaPiana

Loading

sanders41 commented Nov 8, 2022

ThomasLaPiana commented Nov 9, 2022

ThomasLaPiana commented Nov 9, 2022

sanders41 commented Nov 9, 2022 •

edited

Loading

sanders41 commented Nov 9, 2022 •

edited

Loading

sanders41 commented Nov 10, 2022

NevilleS commented Nov 10, 2022

sanders41 commented Nov 10, 2022

ThomasLaPiana commented Nov 13, 2022

ThomasLaPiana commented Nov 13, 2022

ThomasLaPiana commented Nov 13, 2022

ThomasLaPiana commented Nov 14, 2022

sanders41 commented Nov 14, 2022

ThomasLaPiana commented Nov 14, 2022

TheAndrewJackson commented Nov 14, 2022 •

edited

Loading

ThomasLaPiana commented Nov 15, 2022

Fix security config race condition #1718

Fix security config race condition #1718

Conversation

sanders41 commented Nov 8, 2022 • edited by ThomasLaPiana Loading

Code Changes

Steps to Confirm

Pre-Merge Checklist

Description Of Changes

sanders41 commented Nov 8, 2022

ThomasLaPiana commented Nov 9, 2022

ThomasLaPiana commented Nov 9, 2022

sanders41 commented Nov 9, 2022 • edited Loading

sanders41 commented Nov 9, 2022 • edited Loading

sanders41 commented Nov 10, 2022

NevilleS commented Nov 10, 2022

sanders41 commented Nov 10, 2022

ThomasLaPiana commented Nov 13, 2022

ThomasLaPiana commented Nov 13, 2022

ThomasLaPiana commented Nov 13, 2022

ThomasLaPiana commented Nov 14, 2022

sanders41 commented Nov 14, 2022

ThomasLaPiana commented Nov 14, 2022

TheAndrewJackson commented Nov 14, 2022 • edited Loading

ThomasLaPiana commented Nov 15, 2022

sanders41 commented Nov 8, 2022 •

edited by ThomasLaPiana

Loading

sanders41 commented Nov 9, 2022 •

edited

Loading

sanders41 commented Nov 9, 2022 •

edited

Loading

TheAndrewJackson commented Nov 14, 2022 •

edited

Loading