Configuration generates invalid yaml for multiline and object content #179

dee0 · 2024-04-20T00:57:28Z

What happened:

My release failed with
ocm-system dmi-broker 10h False Could not load chart: cannot load values.yaml: error converting YAML to JSON: yaml: line 23: could not find expected ':'

Each of my dictionary and multiline string values were improperly indented. In the screenshot below you can see on the right hand side my file after Configuration was applied. Note that certs.rs.crt is improperly indented. On the left is a copy that I fixed 'by hand' and which is parsable.

The values.yaml in the dir resource in my component version contains

certs:
 intermediate: unset
 root: unset
 ingress:
  crt: unset
  key: unset
 rs:
  intermediate: unset
  root: unset
  crt: unset
  key: unset

My config.yaml in my component version, that is my config for the ocm controller, contains
defaults

configuration:
  defaults:
    certs:
      rs:
        crt: null

rules

  - value: (( certs.rs.crt ))
    file: dmi-broker/values.yaml
    path: certs.rs.crt

Running the command
kubectl get -n broker configmap dmi-broker -o=yaml | yq '.data.config' | yq -C | less -iR
to look at the content of my configmap that is used in my cluster I see the following

Image below shows the 'pipeline' I have defined for the ocm-controller

I believe this is an ocm-controller bug because

values.yaml in the Localization Snapshot is valid, and
data in the ConfigMap is valid, but
values.yaml in the Configuration Snapshot is not valid yaml

What you expected to happen:

I expected the release to work of course :)

How to reproduce it (as minimally and precisely as possible):

I believe you just need to have a rule that is setting a multiline string value or an object value.

Anything else we need to know:

Not that I can think of

Environment:

My ocm-controller was built from this hash of this fork
https://github.com/dee0sap/ocm-controller/tree/228d0be45540590ea51b712d3fb21d2c2082ef72
because I need my quick fix for #68

The text was updated successfully, but these errors were encountered:

dee0sap · 2024-04-20T06:49:40Z

I was able to add to the ocm project a test case which I believe reproduced the problem. You can see what I added in this PR open-component-model/ocm#734

dee0sap · 2024-04-20T17:30:38Z

Think this may be a goccy/go-yaml bug. Either that or ocm is using replace incorrectly.

https://go.dev/play/p/rfnbY5rbSk9

morri-son · 2024-04-22T08:18:47Z

adding @Skarlso

Skarlso · 2024-04-22T08:33:50Z

Yeah, we are just using whateverm OCM does :)

Skarlso · 2024-04-24T14:48:07Z

Uwe found the issue in the corresponding YAML library. goccy/go-yaml#447

morri-son · 2024-04-25T08:29:28Z

@mandelsoft created goccy/go-yaml#447, but since @Skarlso found that the same issue has been reported already two years ago without any fix, most-likely we need to find a solution on our own. @mandelsoft mentioned that he may know an older lib which might act as basis for a fix.

morri-son · 2024-04-26T16:32:56Z

@dee0 confirmed that with using a certain format as input, the lib produces the wanted output. We'll close this issue now and open a new one, checking for a lib that can handle things better.

dee0 · 2024-04-27T22:24:01Z

Hey @morri-son
Actually I haven't confirmed anything yet.

I just tried a new ocm-controller build with @Skarlso's 'don't marshal primitives' change and I think the ocm-controller is still unusable for me. This because object data is not stored correctly by the configuration.

Object data is still yielding invalid yaml. e.g. I get

  orca_env_stable_values:
 certificate_authority_url: http://example1.com 
 deployment: deveaws
 deployment_size: xsmall
 domain: example2.com 
 landscape_region: eu12
 org: deveaws
 service_hostname_suffix: .example3.com 
 service_kubernetes_hostname_suffix: .example4.com

but it needs to be

  orca_env_stable_values:
    certificate_authority_url: http://example1.com 
    deployment: deveaws
    deployment_size: xsmall
    domain: example2.com 
    landscape_region: eu12
    org: deveaws
    service_hostname_suffix: .example3.com 
    service_kubernetes_hostname_suffix: .example4.com

I have not tried the 'formatting the text the right way' trick for the multiline string data yet. And I don't know how I can reasonably do that. In the 'last mile config' we won't know how much indenting is required and in spiff++ I am not seeing a good way to determine and then apply the required amount of indenting.

What I maybe able to do is to use spiff++'s 'asjson' function to double quote the multiline string. I'll try to give that a shot this weekend.

Skarlso · 2024-04-28T05:13:42Z

Uhh. Probably our best option is to switch to yq lib. But it's going to be disruptive.

dee0 · 2024-04-28T13:54:59Z

Hey @Skarlso

Actually it seems like it is pretty easy. At least if the change is limited to subst.go.

While I have not pushed it up to github yet, I made the change in subst.go last night. While some of the tests in subst_test.go failed it looked like it was just due to formatting differences in the output.

I plan on taking a closer look at the failures today. Hopefully I'll be able to resolve them and I'll push my change up to my fork.

Oh, and I think as part of this I want to make sure the tests are covering

My object case that is failing ( need to add something for that )
Updating with yaml sequences, maps, each type of string, at least one non-string scalar and null
Making multiple updates to the same yaml document

Regarding these tesst additions my thinking is

if the first two bullets had been done then we wouldn't be having this problem now.
In my change SubstitutionTarget now has a yqlib CandidateNode instead of an ast.File. This and the structure of the yqlib API make me think it is necessary to confirm multiple updates are having the desired effect

Skarlso · 2024-04-28T13:59:53Z

I didn't say it's hard. It's more like tedious. Also a completely breaking change. I did implement it using yq. I was able to throw out most of the code. The only missing feature was the struct based thing.

The braking change is that the path must start with '.' And values need to be quoted. Which might break some other things and will definitely break existing tests and demos and component versions.

morri-son · 2024-04-28T15:34:40Z

adding @hilmarf and @fabianburth to follow the discussion. We can plan to switch to another lib, but as Gergely mentioned it will be both, requiring changes on implementation as well after that breaking change, on the consumer side...

Skarlso · 2024-04-28T19:04:44Z

I even included a simple example on how to use yqlib on the yq repository. mikefarah/yq#2021
You don't necesserily need a CandidateNode.

dee0 · 2024-04-29T07:03:27Z

Here https://github.com/open-component-model/ocm/actions/runs/8873092383/job/24358384683?pr=734
I have pushed the changed for switching to yqlib.

The path doesn't need to start with a '.'. Consider, with goccy the path needed to start with '$.' and the substitution code took care of this under the covers. So in similar fashion the code I pushed is adding the required '.' under the covers. So that isn't a breaking change.

Unless consumers are doing something silly like depending on the precise formatting of the output YAML or JSON code they shouldn't care about this change.

And on that note, I switched the tests to use MatchYAML and MatchJSON instead of Equals on strings. Testing for string equality

Makes the tests fragile as the slightest change to to formatting of the yaml or json breaks the test. e.g. { "foo": "bar" } won't match { "foo": "bar" }
Allows the test to pass when the output is neither valid yaml nor valid json.

Regarding the switch to CandidateNode in the SubstitutionTarget implementation. This is necessary to efficiently handle multiple updates to the same target.

While I have pushed the changes up and all the existing tests are passing, things I would still like to do before considering this ready for handover to anyone are

Make sure that the test cases I mentioned here are in place
Fix the lint error I see reported
Perform a 'sanity check' review ( I pushed the code up as soon the tests passed but before doing this )

Skarlso · 2024-04-29T07:09:39Z

The path doesn't need to start with a '.'. Consider, with goccy the path needed to start with '$.' and the substitution code took care of this under the covers. So in similar fashion the code I pushed is adding the required '.' under the covers. So that isn't a breaking change.

Yes, you can do that in the background, but I would rather not do any "fixing of the yaml path" which is invisible to the user. That just opens up a bunch of problems when trying to debug failures. Any hidden modification of the yaml path is a problem. I would rather be up-front about what is required to make things work.

Also regarding your implementation, that's almost the same as I wrote, with the expectation that I didn't bother with all the tree parsing. The default all parser can handle all of that and can be re-run as in the example I linked in.

morri-son · 2024-04-29T12:25:42Z

ok folks, then we keep this issue open until we really figured out which way we plan to go in the future, using doccy or anything else, e.g. yg. Once the discussion has finalised, we'll spin up a new issue with the exact steps to be done.

Skarlso · 2024-04-29T12:28:05Z

yah, I think this issue is fine. Dan has some great ideas and we can spit-ball in this issue. :)

dee0 · 2024-04-29T14:25:15Z

Thanks :)

As for making debugging easier,

In the log message(s) make it clear what the source of the configuration value is, what the target file is and what the yq expression used is. Have the message clearly indicate it a yq expression
Provide tooling that lets me extract the source and the target file to disk

Then a can run the yq cli myself to validate whatever substitutions are taking place

Fwiw, this is pretty much what I was doing at the start of this except I was using go playground because I didn't goccy doesn't appear to have a cli.

Actually, elaborating a bit more on what I was doing

I have an Ingress that I add to my cluster and which exposes the registry. This enabled the next step.
I have a set of shell scripts that use crane to fetch the snapshot contents for a Resource, Localization or Configuration.
After pulling down the images as tgz I would untar the biggest tar file they contained, assuming it contained my helm chart resource
Then I would pass the values.yaml to the yq cli to figure out which file was invalid. I used the yq cli to make sure my paths were correct. ( Adding the required '.' :) )

So I didn't need to maintain my own ingress and extraction scripts and if there log messages that told me what I need to extract and what the expression used was I think that would go a long way to making it so that problems were easier to diagnose.

dee0 · 2024-04-30T07:39:06Z

This morning I fixed the lint error however I see there is a test failure. That test doesn't fail locally.

This evening I added all the tests I said I wanted to add.

And I confirmed that ocm-controller build with the changes in my ocm PR works. That is, the service I was trying to onboard to ocm+ocm-controller+flux deployment actually was deployed. :)

I hope to have time tomorrow evening to look into that failing test + do a sanity check of the changes in the PR.

Oh, and at least when I am running the tests within vs code yqlib is very chatty. I'll check if that is the case during normal execution and if so look into reducing its verbosity.

Skarlso · 2024-04-30T07:49:30Z

yqlib is very chatty

There is a logger that it uses that you can set to debug mode at the begin to display it.

Skarlso · 2024-04-30T08:38:50Z

// GetLogger returns the yq logger instance.
func GetLogger() *logging.Logger {
	return log
}

This is the thing that you need to call and set it to debug.

dee0 · 2024-05-01T06:09:03Z

At least within the context of this ticket, I think I have made all the changes I would like to see in OCM.

This evening I

Added a test case that validates that if destination is yaml style then any substitution that is added doesn't change that ( and fixed the code since it was failign )
Reduced the yqlib verbosity
Addressed the test failure from last night ( by refreshing from main branch )

So from my perspective what is left is

I need to perform a sanity check of what is in the PR
Need to handle any feedback + address any bureaucratic things

## Description Fixes open-component-model/ocm-project#179 ## What type of PR is this? (check all applicable) - [ ] 🍕 Feature - [x ] 🐛 Bug Fix - [ ] 📝 Documentation Update - [ ] 🎨 Style - [ ] 🧑‍💻 Code Refactor - [ ] 🔥 Performance Improvements - [x ] ✅ Test - [ ] 🤖 Build - [ ] 🔁 CI - [ ] 📦 Chore (Release) - [ ] ⏩ Revert ## Related Tickets & Documents  - Related Issue # 179 - Closes # (179) - Fixes # (179) > Remove if not applicable ## Screenshots  ## Added tests? - [x ] 👍 yes - [ ] 🙅 no, because they aren't needed - [ ] 🙋 no, because I need help - [ ] Separate ticket for tests # (issue/pr) Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce. Please also list any relevant details for your test configuration ## Added to documentation? - [ ] 📜 README.md - [ x] 🙅 no documentation needed ## Checklist: - [x ] My code follows the style guidelines of this project - [ x] I have performed a self-review of my code - [ x] I have commented my code, particularly in hard-to-understand areas - [ x] I have made corresponding changes to the documentation - [ x] My changes generate no new warnings - [ x] I have added tests that prove my fix is effective or that my feature works - [ x] New and existing unit tests pass locally with my changes - [ x] Any dependent changes have been merged and published in downstream modules --------- Co-authored-by: Gergely Brautigam <[email protected]>

dee0 added kind/bugfix Bug kind/task small task, normally part of feature or epic labels Apr 20, 2024

github-project-automation bot added this to OCM Backlog Board Apr 20, 2024

github-project-automation bot moved this to 🆕 ToDo in OCM Backlog Board Apr 20, 2024

morri-son added the component/ocm-controllers OCM Controllers label Apr 22, 2024

morri-son moved this from 🆕 ToDo to 🏗 In Progress in OCM Backlog Board Apr 22, 2024

Skarlso self-assigned this Apr 24, 2024

morri-son mentioned this issue Apr 25, 2024

Check how to fix issue with multi-line strings and goccy/go-yaml library #182

Closed

morri-son added component/ocm-core Open Component Model Core aka. go API area/ipcei Important Project of Common European Interest labels Apr 25, 2024

morri-son added this to the 2024-Q2 milestone Apr 25, 2024

Skarlso moved this from 🏗 In Progress to 🔍 Review in OCM Backlog Board Apr 25, 2024

Skarlso mentioned this issue Apr 25, 2024

Fix ocm issue 179, block in config yields invalid yaml open-component-model/ocm#734

Merged

25 tasks

morri-son moved this from 🔍 Review to 🍺 Done in OCM Backlog Board Apr 26, 2024

morri-son moved this from 🍺 Done to 🏗 In Progress in OCM Backlog Board Apr 29, 2024

Skarlso assigned dee0 and unassigned Skarlso Apr 30, 2024

Skarlso closed this as completed in open-component-model/ocm#734 May 3, 2024

github-project-automation bot moved this from 🏗 In Progress to 🍺 Done in OCM Backlog Board May 3, 2024

ocmbot bot moved this from 🍺 Done to 🔒Closed in OCM Backlog Board May 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Configuration generates invalid yaml for multiline and object content #179

Configuration generates invalid yaml for multiline and object content #179

dee0 commented Apr 20, 2024 •

edited

Loading

dee0sap commented Apr 20, 2024

dee0sap commented Apr 20, 2024

morri-son commented Apr 22, 2024

Skarlso commented Apr 22, 2024

Skarlso commented Apr 24, 2024

morri-son commented Apr 25, 2024

morri-son commented Apr 26, 2024

dee0 commented Apr 27, 2024

Skarlso commented Apr 28, 2024

dee0 commented Apr 28, 2024

Skarlso commented Apr 28, 2024 •

edited

Loading

morri-son commented Apr 28, 2024

Skarlso commented Apr 28, 2024 •

edited

Loading

dee0 commented Apr 29, 2024

Skarlso commented Apr 29, 2024

morri-son commented Apr 29, 2024

Skarlso commented Apr 29, 2024

dee0 commented Apr 29, 2024

dee0 commented Apr 30, 2024

Skarlso commented Apr 30, 2024

Skarlso commented Apr 30, 2024

dee0 commented May 1, 2024

Configuration generates invalid yaml for multiline and object content #179

Configuration generates invalid yaml for multiline and object content #179

Comments

dee0 commented Apr 20, 2024 • edited Loading

dee0sap commented Apr 20, 2024

dee0sap commented Apr 20, 2024

morri-son commented Apr 22, 2024

Skarlso commented Apr 22, 2024

Skarlso commented Apr 24, 2024

morri-son commented Apr 25, 2024

morri-son commented Apr 26, 2024

dee0 commented Apr 27, 2024

Skarlso commented Apr 28, 2024

dee0 commented Apr 28, 2024

Skarlso commented Apr 28, 2024 • edited Loading

morri-son commented Apr 28, 2024

Skarlso commented Apr 28, 2024 • edited Loading

dee0 commented Apr 29, 2024

Skarlso commented Apr 29, 2024

morri-son commented Apr 29, 2024

Skarlso commented Apr 29, 2024

dee0 commented Apr 29, 2024

dee0 commented Apr 30, 2024

Skarlso commented Apr 30, 2024

Skarlso commented Apr 30, 2024

dee0 commented May 1, 2024

dee0 commented Apr 20, 2024 •

edited

Loading

Skarlso commented Apr 28, 2024 •

edited

Loading

Skarlso commented Apr 28, 2024 •

edited

Loading