Retry of producing a stage output #167

windiana42 · 2024-03-21T22:07:40Z

We currently can only retrieve results from cache for tables that managed to get through a stage commit (i.e. schema swapping procedure). However, in case of really large tables, it might be nice to just continue getting a stage output produced where it stopped in a previous crash. We do cache invalidation on table level. So it would be possible to track that. The tricky aspect would be that we basically need to validate that everything in the transaction schema is cache valid up to one point and then continue from there.

There is a workaround for this problem. One can wrap every table with its own nested Stage. This actually achieves pretty much the desired result as described above. However, in the end, it would be nice if all tables were still located within one schema.
The other downside of the workaround is that in fact the schema swapping of the surrounding Stage is made useless since each table has a separate stage commit and overwrites its cache one by one.

windiana42 added enhancement New feature or request usability refactoring labels Mar 21, 2024

windiana42 mentioned this issue May 7, 2024

Allow running tasks on active schema (Big DANGER mode disclaimer) #192

Open

windiana42 mentioned this issue Jun 11, 2024

Mixed per-user and team-shared pipeline runs #199

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Retry of producing a stage output #167

Retry of producing a stage output #167

windiana42 commented Mar 21, 2024

Retry of producing a stage output #167

Retry of producing a stage output #167

Comments

windiana42 commented Mar 21, 2024