[CT-1635] Populate adapter_response for tests + sources #2964

jtcohen6 · 2020-12-17T14:44:25Z

Describe the feature

#2747 requested that dbt log bytes processed for all query types, including tests, source-freshness, and run-operations. This prompted the first part of a solution in #2961: adding an adapter_response dict to record database-specific information about the queries dbt has run. In v0.19.0, that will include bytes_processed for all materializations (model runs, seeds, snapshots).

We'd like to record bytes processed for tests and source snapshots, too. What does this really mean?

dbt test should populate adapter_response in run_results.json
dbt source snapshot-freshness should populate adapter_response in sources.json

(I'm less clear on how we'd manage this in run-operation, since it doesn't write to run_results.)

Eventually, this has the potential to dovetail nicely with:

#2079: add a new log line to the "summary" stdout, with the total bytes processed from the invocation. In an ideal world, users could get a human-friendly rounded number in stdout and the raw numbers in run_results.json for analysis/aggregation.
Configurable node status in log output #2580: the ability to define which result information is populated to message in stdout (maybe via Jinja macro??)

Describe alternatives you've considered

Not doing this

Additional context

Most immediately relevant to BigQuery (bytes processed)
Useful for any database which sends back significant information in its response

Who will this benefit?

Projects with larger datasets, where filtering is important to limit cost of test queries

The text was updated successfully, but these errors were encountered:

github-actions · 2021-11-07T01:48:18Z

This issue has been marked as Stale because it has been open for 180 days with no activity. If you would like the issue to remain open, please remove the stale label or comment on the issue, or it will be closed in 7 days.

github-actions · 2022-05-15T02:13:15Z

This issue has been marked as Stale because it has been open for 180 days with no activity. If you would like the issue to remain open, please remove the stale label or comment on the issue, or it will be closed in 7 days.

jtcohen6 · 2022-06-01T09:22:03Z

I think this may be as simple as using result.adapter_response to populate the RunResult for tests + freshness results, where currently we're setting them to just {}:

https://github.com/dbt-labs/dbt-core/blob/main/core/dbt/include/global_project/macros/materializations/models/table/create_table_as.sql

dbt-core/core/dbt/task/freshness.py

Line 123 in 75f3e8c

adapter_response={},

We can look to the run task for the reference implementation:

dbt-core/core/dbt/task/run.py

Lines 221 to 223 in 75f3e8c

    
           adapter_response = {} 
        
           if isinstance(result.response, dbtClassMixin): 
        
               adapter_response = result.response.to_dict(omit_none=True)

jtcohen6 · 2022-11-15T14:22:28Z

One thing we'd need to change to get this working for dbt source freshness: Currently, the collect_freshness macro returns just its table result. We need it to return the response object as well, which we can then serialize into an adapter_response dictionary.

To illustrate the changes that I think would be needed: a066c1e

sebas-bdr · 2022-12-09T13:58:24Z

hey @jtcohen6, this feature would be great.
I noticed the help_wanted label was recently removed, not sure (from reading the docs) whether this means contributions won't be accepted. But if possible I'd like to contribute.

To illustrate the changes that I think would be needed: a066c1e

Regarding testing, would the following code change suffice (specifically for source freshness)? sebas-bdr@572f143

jamwalla · 2023-01-17T21:37:07Z

I would love to have this feature to see how much my tests are costing!

I was able to get it working pretty easily for tests by returning the adapter response alongside the TestResultData object in TestRunner.execute_test:

dbt-core/core/dbt/task/test.py

Line 94 in 89d111a

def execute_test(self, test: TestNode, manifest: Manifest) -> TestResultData:

Same way that the ModelRunner does it:

dbt-core/core/dbt/task/run.py

Line 219 in 89d111a

adapter_response = result.response.to_dict(omit_none=True)

Then the adapter response can get passed up through TestRunner.execute.

Maybe not the prettiest solution but wanted to advocate for this being very useful + straightforward! thx dbt devs <3

resolves #2964

resolves dbt-labs/dbt-core#2964

jtcohen6 added the enhancement New feature or request label Dec 17, 2020

jtcohen6 added this to the Oh-Twenty [TBD] milestone Dec 17, 2020

jtcohen6 changed the title ~~Record bytes processed for tests~~ Populate adapter_response for tests + sources Dec 22, 2020

jtcohen6 added the artifacts label Dec 22, 2020

jtcohen6 mentioned this issue Dec 22, 2020

Add note about adapter_response in run_results.json, sources.json dbt-labs/docs.getdbt.com#495

Merged

3 tasks

jtcohen6 added the dbt tests Issues related to built-in dbt testing functionality label Dec 31, 2020

jtcohen6 mentioned this issue Mar 10, 2021

[Q1C2] More consistent, configurable tests #3066

Closed

jtcohen6 removed this from the Margaret Mead milestone May 10, 2021

github-actions bot added the stale Issues that have gone stale label Nov 7, 2021

github-actions bot closed this as completed Nov 16, 2021

kwigley removed the stale Issues that have gone stale label Nov 16, 2021

kwigley reopened this Nov 16, 2021

github-actions bot added the stale Issues that have gone stale label May 15, 2022

jtcohen6 added help_wanted Trickier changes, with a clear starting point, good for previous/experienced contributors and removed stale Issues that have gone stale labels Jun 1, 2022

Fleid added Team:Adapters Issues designated for the adapter area of the code jira and removed help_wanted Trickier changes, with a clear starting point, good for previous/experienced contributors labels Dec 8, 2022

github-actions bot changed the title ~~Populate adapter_response for tests + sources~~ [CT-1635] Populate adapter_response for tests + sources Dec 8, 2022

aezomz mentioned this issue Jan 18, 2023

add adapter_response for test #6645

Merged

6 tasks

jtcohen6 mentioned this issue Jan 24, 2023

[CT-1878] [Feature] <Optionally provide job_id for every model that gets executed> dbt-labs/dbt-bigquery#475

Closed

3 tasks

ChenyuLInx closed this as completed in #6645 Jan 24, 2023

ChenyuLInx pushed a commit that referenced this issue Jan 24, 2023

add adapter_response for test (#6645)

17014bf

resolves #2964

hasyimibhar pushed a commit to ridebeam/dbt-core that referenced this issue Mar 5, 2023

add adapter_response for test (#6645)

19e0916

resolves dbt-labs/dbt-core#2964

dbeatty10 mentioned this issue Apr 4, 2024

[Feature] Display the amount of processed data when running tests dbt-labs/dbt-bigquery#1169

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CT-1635] Populate adapter_response for tests + sources #2964

[CT-1635] Populate adapter_response for tests + sources #2964

jtcohen6 commented Dec 17, 2020 •

edited

Loading

github-actions bot commented Nov 7, 2021

github-actions bot commented May 15, 2022

jtcohen6 commented Jun 1, 2022

jtcohen6 commented Nov 15, 2022

sebas-bdr commented Dec 9, 2022

jamwalla commented Jan 17, 2023

[CT-1635] Populate adapter_response for tests + sources #2964

[CT-1635] Populate adapter_response for tests + sources #2964

Comments

jtcohen6 commented Dec 17, 2020 • edited Loading

Describe the feature

Describe alternatives you've considered

Additional context

Who will this benefit?

github-actions bot commented Nov 7, 2021

github-actions bot commented May 15, 2022

jtcohen6 commented Jun 1, 2022

jtcohen6 commented Nov 15, 2022

sebas-bdr commented Dec 9, 2022

jamwalla commented Jan 17, 2023

jtcohen6 commented Dec 17, 2020 •

edited

Loading