CT 1998 use google protobuf to enable more flexible dictionaries #7190

gshank · 2023-03-18T20:50:22Z

resolves #6832

Description

In order to have more flexible dictionaries in our logging events, we need to have the protobuf Struct data type. This isn't supported by betterproto. In addition, Optional is only supported with a beta version. In order to have Structs, we are switching to using google protobuf instead.

There are a number of differences in implementation between betterproto and google protobuf. In google protobuf the generated "Python" classes are not very Python-like, and the interfaces are odd. Betterproto had support for a datetime-like timestamp field, which is different than protobuf. Protobuf can instantiate messages from nested dictionaries. It also catches type mismatches that betterproto didn't. When using betterproto it was necessary to pre-construct nested messages; this can't be done in protobuf. The interfaces for serializing are different.

Checklist

I have read the contributing guide and understand what's expected of me
I have signed the CLA
I have run this code in development and it appears to resolve the stated issue
This PR includes tests, or tests are not required/relevant for this PR
I have opened an issue to add/update docs, or docs changes are not required/relevant for this PR
I have run changie new to create a changelog entry

dictionary

jtcohen6

Very cool! Really appreciated the walkthrough & recording :)

I tried this out locally, and I didn't see any obvious changes in JSON-formatted logs. No blocking comments from me.

jtcohen6 · 2023-03-21T12:57:06Z

core/dbt/events/functions.py



 LOG_VERSION = 3
 metadata_vars: Optional[Dict[str, str]] = None

+nofile_codes = ["Z012", "Z013", "Z014", "Z015"]


Since we can no longer do this via class inheritance (NoFile), we need to explicitly state here which types are "no file."

This is fine IMO - we don't have so many of these things - but it does feel trickier to test, now that these are no longer attributes.

(After watching the recorded walkthrough) Could we leave an inline code comment here? These events are specific to dbt clean, and we can't write file logs during clean because the log file might be one of the things that dbt is cleaning (= race condition)

I added a comment. In theory we could still use something like classes or method flags, but we'd have to get the class from msg.info.name, do getattr(types, class_name) and then check for the existence of another class or execute a class method. So that's still on option if some other flag thing comes along that makes it worth it.

cool! agree that, in this case, it doesn't feel worth it — hard-coding the codes is a right-sized solution

jtcohen6 · 2023-03-21T12:57:53Z

core/dbt/contracts/graph/nodes.py

@@ -223,10 +219,9 @@ def node_info(self):
            "node_status": str(self._event_status.get("node_status")),
            "node_started_at": self._event_status.get("started_at"),
            "node_finished_at": self._event_status.get("finished_at"),
-            "meta": meta_stringified,
+            "meta": getattr(self, "meta", {}),


jtcohen6 · 2023-03-21T12:58:30Z

core/dbt/contracts/results.py

+            msg_dict["started_at"] = self.started_at.strftime("%Y-%m-%dT%H:%M:%SZ")
+        if self.completed_at:
+            msg_dict["completed_at"] = self.completed_at.strftime("%Y-%m-%dT%H:%M:%SZ")
+        return msg_dict


Is this worth a utility function? It also looks like this is different from timestamp_to_datetime_string - just based on the precision we want ("%H:%M:%S.%f" vs "%Y-%m-%dT%H:%M:%SZ") - for parity with status quo?

I created a datetime_to_json_format_string utility method, and used it here and in base_types.py. The other one (timestampe_to_datetime_string) goes in a different direction, from the timestamp to a string and string is shorter

jtcohen6 · 2023-03-21T16:21:40Z

core/dbt/contracts/results.py

@@ -67,7 +68,7 @@ def __exit__(self, exc_type, exc_value, traceback):
        with TimingProcessor(self.timing_info):
            fire_event(
                TimingInfoCollected(
-                    timing_info=self.timing_info.to_msg(), node_info=get_node_info()
+                    timing_info=self.timing_info.to_msg_dict(), node_info=get_node_info()


In practice, do we expect this change (nested message → nested dict) to be breaking in any way? My sense is that it looks the same, if the event is being serialized to JSON, which is our only official current interface for consuming structured logs

I think it should have the same output. In betterproto we had to provide the pre-constructed sub-messages (which is the way python works) and with google protobuf, you can't.

jtcohen6 · 2023-03-21T16:24:01Z

core/dbt/events/base_types.py

+        except Exception as exc:
+            raise Exception(f"[{class_name}]: Unable to parse dict {kwargs},\n exc: {exc}")


As discussed during the live walkthrough, we want to stop throwing an exception (and stopping the run), and instead just log a warning that we failed to serialize the event:

[CT-2264] Don't raise exception during event serialization failure, just log warning #7113

@gshank made the point that it's a bit weird to fire an event within the code for firing events. I don't have strong opinions here on the implementation — just that, as an end user, I would want dbt to more gracefully handle serialization errors.

The nested fire_event seems to be working okay. Added a test for it.

docs/arch/adr-005-betterproto.md

jtcohen6 · 2023-03-21T16:36:37Z

docs/arch/adr-005-betterproto.md

+Steps taken to mitigate the drawbacks of Google protobuf from above:
+* We are using a wrapping class around the logging events to enable a constructor that looks more like a Python constructor, as long as only keyword arguments are used.
+* The generated file is skipped in the pre-commit config
+* We can live with the awkward interfaces. It's just code.


We can live with the awkward interfaces. It's just code.

:)

gshank added 2 commits March 18, 2023 16:39

Swith from betterproto to google protobuf and enable more flexible meta

241062e

dictionary

Changie

cfce5aa

gshank requested a review from a team as a code owner March 18, 2023 20:50

gshank requested a review from a team March 18, 2023 20:50

gshank requested review from a team as code owners March 18, 2023 20:50

gshank requested review from Fleid, aranke and peterallenwebb March 18, 2023 20:50

cla-bot bot added the cla:yes label Mar 18, 2023

gshank requested review from emmyoop and removed request for Fleid and aranke March 18, 2023 22:40

gshank added 9 commits March 18, 2023 19:13

Checks positional args passed to logging events

e30f1a5

Fix some timestamps

42ddcf5

Fix cache filter

03f1c00

add protobuf to setup.py

dbe2253

Remove NoFile and NoStdOut.

b49a1b8

Remove unneeded test_types.py

f177944

fix cache logging

4a7374d

Make LogSnapshotResult not a struct

152d797

Update betterproto ADR

640ba24

emmyoop mentioned this pull request Mar 21, 2023

update workflow to install dev requirements and remove action depreca… #7203

Merged

6 tasks

Merge branch 'main' into ct-1998-use_google_protobuf

ef935c6

jtcohen6 reviewed Mar 21, 2023

View reviewed changes

gshank added 5 commits March 21, 2023 14:48

Address various comments, utility functions, etc.

cce665e

Merge branch 'main' into ct-1998-use_google_protobuf

49febc6

Fix up after merge

7ce4536

Merge branch 'main' into ct-1998-use_google_protobuf

b18d664

Log inability to parse event arguments, add test

1e00f05

gshank added 3 commits March 22, 2023 12:29

Merge branch 'main' into ct-1998-use_google_protobuf

9babfa3

Fix up after merge

008387f

Tweak test which sometimes has different error due to timing

2ec2472

peterallenwebb approved these changes Mar 22, 2023

View reviewed changes

gshank merged commit ae485f9 into main Mar 22, 2023

gshank deleted the ct-1998-use_google_protobuf branch March 22, 2023 19:59

jtcohen6 mentioned this pull request Mar 27, 2023

New command: dbt show #7208

Merged

6 tasks

QMalcolm mentioned this pull request Mar 31, 2023

CT-2264, CT-2259, CT-1783: Improved event serialization failure handling #7249

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CT 1998 use google protobuf to enable more flexible dictionaries #7190

CT 1998 use google protobuf to enable more flexible dictionaries #7190

gshank commented Mar 18, 2023

jtcohen6 left a comment

jtcohen6 Mar 21, 2023

gshank Mar 21, 2023

jtcohen6 Mar 21, 2023

jtcohen6 Mar 21, 2023

jtcohen6 Mar 21, 2023

gshank Mar 21, 2023

jtcohen6 Mar 21, 2023

gshank Mar 21, 2023

jtcohen6 Mar 21, 2023

gshank Mar 22, 2023

jtcohen6 Mar 21, 2023 •

edited

Loading

		except Exception as exc:
		raise Exception(f"[{class_name}]: Unable to parse dict {kwargs},\n exc: {exc}")

CT 1998 use google protobuf to enable more flexible dictionaries #7190

CT 1998 use google protobuf to enable more flexible dictionaries #7190

Conversation

gshank commented Mar 18, 2023

Description

Checklist

jtcohen6 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jtcohen6 Mar 21, 2023 • edited Loading

Choose a reason for hiding this comment

jtcohen6 Mar 21, 2023 •

edited

Loading