-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Lake][ETL] DuckDB E2E - Ingestion -> Dashboards #685
Labels
Comments
2 tasks
This was referenced Mar 5, 2024
Discovered some issues related to data fetching on the main branch, because multiple things are rewritten in the ETF flow I leave them here in case they got solved along the way, if not maybe here would be the place to solve them:
|
This was referenced Apr 29, 2024
This was referenced May 6, 2024
Closing this ticket and moving remaining items to some other ticket so i can reconcile outstanding issues. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Motivation
We completed the techspikes around our data infrastructure. As an outcome, we're going to implement our ETL pipeline in DuckDB such that we maintain many of our constraints:
Outline
Our first goal is to take the current ETL workflow and update it end-to-end.
Shelved Deliverables
CLOSED TICKET - Add ETL checkpoint to enforce SLAs, and process data incrementally. #694
Reason: We're going to instead implement a build step that leverages a simple SQL strategy w/ temp tables, such that we can enforce SLAs in a clean manner.
DoD
[First Deliverable - Update Ingestion + Load]
[Core System Updates]
[Update ETL Deliverables]
etl_
view for ETL build steps. Downstream bronze and silver tables require data from bothlive_
andbuild_
tables. #810[ETL CLI Deliverables]
pdr analytics describe, query, validate, resume
CLI command #883[Cleanup Deliverables]
parquet_dir
uselake_dir
instead #770[Ratchet Integration]
[Post-DuckDB Merge - Core Functionality]
[Post-DuckDB - Peripheral Functionality]
These are frozen. Do not start/complete until DuckDB review/work is complete.
fill
becomesinsert
,override
becomesupsert
The text was updated successfully, but these errors were encountered: