Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

master merge for 0.5.4 release #1756

Merged
merged 37 commits into from
Aug 28, 2024
Merged

master merge for 0.5.4 release #1756

merged 37 commits into from
Aug 28, 2024

Conversation

rudolfix
Copy link
Collaborator

Description

master merge for 0.5.4 release

willi-mueller and others added 30 commits August 9, 2024 15:56
* add storage_options to delta table create statement
* add delta table partitioning support

* document delta table partitioning support

* Update docs/website/docs/dlt-ecosystem/destinations/filesystem.md

---------

Co-authored-by: Anton Burnashev <[email protected]>
* displays log counter when new counter is created

* initializes load job counters only when package starts
…tom-auth

rest_api: documents pluggable custom auth
…-empty-page

RESTClient: stops pagination after empty page (Feat/1637)
* make sorting optional

* enable scd2 record reinsertion by dropping unique constraint

* document scd2 row id uniqueness characteristics

* assert unique constraint is not dropped when users bring their own hash
* black format

* remove code remnant

* add scd2 custom boundary timestamp feature

* add invalid scd2 active record timestamp test

* document scd2 boundary timestamp argument
* make lint fail on black format diff and add diff coloring

* format with black
* collects basic load job metrics in LoadJob

* adds remote uri to filesystem copy jobs metrics

* adds job id to load package info

* adds table name to job metrics

* skips run step when serializing trace

* adds trace shape test with trace schema

* tests job file name too long

* docs running pipelines with the same name for different envs

* extracts step metrics in common, renames followup jobs

* fixes tests

* fixes tests

* tests delta filesystem for remote_uri

* adds exec_info to trace contract test

* tests remote_uri for filesystem copy

* fixes platform test
* uses normalized column names when linking tables in relational

* destination cap if create table if not exits supported

* generates IF NOT EXISTS for dlt tables

* adds logging for terminal and retry exception in run_managed of load job

* passes schema update to be collected in trace in filesystem

* fixes job log exception message
…ble format (#1742)

* black format

* increase minimum deltalake version dependency

* enable schema evolution for delta table merge

* extract delta table merge logic into separate function

* remove big decimal exclusion due to upstream bugfix

* evolve delta table schema in empty source case

* refactor DeltaLoadFilesystemJob

* uses right table path format in delta lake load job

* allows to pass schema name when getting delta tables and computing table counts

* cleansup usage of remote paths and uris in filesystem load jobs

* removes tempfile from file_storage

---------

Co-authored-by: Marcin Rudolf <[email protected]>
rudolfix and others added 5 commits August 27, 2024 18:32
* Expose staging tables truncation to config

* Fix comments, add tests

* Fix tests

* Move implementation from mixing, add tests

* Fix docs grammar
* allows to configure external location and named credential for databricks

* fixes #1703

* normalizes 'value' when wrapping simple objects in relational, fixes #1754

* simplifies fsspec globbing and allows various url formats that are preserved when reconstituting full url, allows abfss databricks format

* adds info on partially loaded packages to docs

* renames remote_uri to remote_url in traces

* fixes delta for abfss

* adds nested tables dlt columns collision test
@rudolfix rudolfix added the ci full run the full load tests on pr label Aug 28, 2024
@rudolfix rudolfix self-assigned this Aug 28, 2024
Copy link

netlify bot commented Aug 28, 2024

Deploy Preview for dlt-hub-docs canceled.

Name Link
🔨 Latest commit e9c9ecf
🔍 Latest deploy log https://app.netlify.com/sites/dlt-hub-docs/deploys/66cf7e583eb93b00086e89c3

* always truncates staging tables on athena + replace without iceberg

* adds athena staging configs to all staging configs

* updates athena tests for staging destination
@rudolfix rudolfix merged commit 9857029 into master Aug 28, 2024
43 of 46 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ci full run the full load tests on pr
Projects
None yet
Development

Successfully merging this pull request may close these issues.

8 participants