Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

explicitly adding docs for destination item size control #2118

Merged
merged 4 commits into from
Dec 13, 2024

Conversation

HulmaNaseer
Copy link
Collaborator

Description

Adding explicit documentation on how to control the destination item sizes. The files created in normalize stage are used for loading as well so controlling normalize file sizes (by setting max file size or enabling rotation) should control destination items sizes too. Currently, the bot does not pick it up as it's not explicit.

Related Issues

  • Fixes #...
  • Closes #...
  • Resolves #...

Additional Context

Copy link

netlify bot commented Dec 4, 2024

Deploy Preview for dlt-hub-docs ready!

Name Link
🔨 Latest commit 30b1185
🔍 Latest deploy log https://app.netlify.com/sites/dlt-hub-docs/deploys/675bf46da531980008401e88
😎 Deploy Preview https://deploy-preview-2118--dlt-hub-docs.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

Copy link
Contributor

@AstrakhantsevaAA AstrakhantsevaAA left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

the whole section looks confusing for me, I would make it as a note or info admonition in the Load section.
I would rephrase it as follows:

By enabling file rotation and adjusting the file_max_items and file_max_bytes settings, you can control the size and number of data chunks sent to the destination database. This can be particularly useful when dealing with large tables with millions of records, as it allows you to manage memory constraints on the database server and enhance performance.

@@ -50,6 +50,8 @@ Below, we set files to rotate after 100,000 items written or when the filesize e

<!--@@@DLT_SNIPPET ./performance_snippets/toml-snippets.toml::file_size_toml-->

### Controlling destination items sizes
As mentioned above, `dlt` uses the the same files created between **normalize** and **load** for loading to the destination. You can therefore control the destination items' sizes by controlling the maximum single file size or enabling rotation at the **normalize** stage as demonstrated in the snippet above.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
As mentioned above, `dlt` uses the the same files created between **normalize** and **load** for loading to the destination. You can therefore control the destination items' sizes by controlling the maximum single file size or enabling rotation at the **normalize** stage as demonstrated in the snippet above.
As mentioned above, `dlt` uses the same files created between **normalize** and **load** steps for loading data to the destination. You can therefore control the destination items' sizes by controlling the maximum single file size or enabling rotation at the **normalize** stage as demonstrated in the snippet above.

Copy link
Contributor

@AstrakhantsevaAA AstrakhantsevaAA left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good! Thank you Hulma <3

@AstrakhantsevaAA AstrakhantsevaAA merged commit 80ca474 into devel Dec 13, 2024
49 checks passed
@AstrakhantsevaAA AstrakhantsevaAA deleted the docs/control-destination-item-size branch December 13, 2024 10:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants