Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support dynamically table addition in flink-cdc-base #1152

Closed
Tracked by #1728 ...
PatrickRen opened this issue May 6, 2022 · 20 comments
Closed
Tracked by #1728 ...

Support dynamically table addition in flink-cdc-base #1152

PatrickRen opened this issue May 6, 2022 · 20 comments
Assignees
Labels
3.1 enhancement New feature or request task The development task

Comments

@PatrickRen
Copy link
Contributor

Currently flink-cdc-base framework doesn't support discovering and adding tables dynamically. This feature is already implemented in MySQL CDC connector so it's required to support it in the framework in order to adapt MySQL CDC connector to flink-cdc-base.

@PatrickRen PatrickRen added the enhancement New feature or request label May 6, 2022
@molsionmo
Copy link
Contributor

please assign this task to me if no one accepts this task. I am happy to accept this assignment @PatrickRen

@sap1ens
Copy link
Contributor

sap1ens commented Dec 6, 2022

Hey @PatrickRen @leonardBang, do you know if anyone is actively working on this?

I believe this is needed for #1163 (we'd like to ship incremental support in Postgres with scan.newly-added-table.enabled feature). I'd love to be assigned, I've already started working on this.

@leonardBang
Copy link
Contributor

Hello, @molsionmo Are you still working for this ?

@sap1ens
Copy link
Contributor

sap1ens commented Dec 21, 2022

Could you reassign this to me? I created a PR for this: #1838

@molsionmo
Copy link
Contributor

@leonardBang I'm sorry for not replying in time. I developed part of the work before and didn't have time to submit PR. I submitted that PR separately and compared the work content of sap1ens with many similar parts. Thank you @sap1ens
for your excellent work.

My PR section just includes the Support dynamically table addition in Flinks-CDC-base. If sap1ens PR is adopted, I will close my pr and participate in the review and test work.

@ruanhang1993 ruanhang1993 removed this from the V2.4.0 milestone May 19, 2023
@sap1ens
Copy link
Contributor

sap1ens commented Jun 16, 2023

Can anyone look at my PR again?

@leonardBang
Copy link
Contributor

Can anyone look at my PR again?

Hey, @sap1ens Jiabao is helping to review the PR, but recently we're busy on 2.4 version code freeze, so may be the review work would be continued later. And the PR is a huge enhancement and I'd like to put it to next version as it's close to code freeze date. WDY?

@sap1ens
Copy link
Contributor

sap1ens commented Jun 16, 2023

Sure, just wanted to remind before the 2.4 release, but it looks like it's too late :) No worries.

@1032851561
Copy link

Hope to be able to merge to version 2.4

@ruanhang1993 ruanhang1993 mentioned this issue Jun 27, 2023
5 tasks
@sap1ens
Copy link
Contributor

sap1ens commented Aug 29, 2023

Hi, just wanted to remind you about the PR again, thanks!

@sap1ens
Copy link
Contributor

sap1ens commented Dec 14, 2023

Could this be considered for 3.1? I can look into rebasing the PR if needed, assuming it'll get the attention.

@leonardBang leonardBang added this to the V3.1.0 milestone Dec 15, 2023
@leonardBang
Copy link
Contributor

@sap1ens I added this to 3.1 roadmap. @molsionmo Do you time to finish this in 3.1 version? we can find someone to finish this task if you are busy in your company business.

@leonardBang leonardBang added task The development task 3.1 labels Dec 15, 2023
@sap1ens
Copy link
Contributor

sap1ens commented Dec 15, 2023

I'll take a look at the PR tomorrow and let you know. Thanks!

@leonardBang leonardBang assigned sap1ens and unassigned molsionmo Dec 15, 2023
@sap1ens
Copy link
Contributor

sap1ens commented Dec 19, 2023

@leonardBang I've updated the PR: #1838. However, a lot of things have changed since December 2022 🙂. I found several PRs with changes for this feature, including a very large one.

What's your guidance here?

Should we copy the latest implementation of the Scan Newly Added Tables feature? It'll probably take me several days to accommodate new changes + there is more testing needed. But it may make sense to do it if you think that the existing implementation in MySQL is significantly better (the non-blocking reads are great).

On the other hand, if the current PR is good enough I can quickly add support for Postgres after that and it's already well-tested in prod (we've been running it in prod for about a year).

@leonardBang
Copy link
Contributor

Hey, @sap1ens thanks for your updating.
I think we should copy the latest implementation which is better than before, and we can wait this feature in 3.1 release, we have enough time to finish this in 3.1 version development circle. WDYT?

@loserwang1024
Copy link
Contributor

loserwang1024 commented Dec 19, 2023

@sap1ens, it seems that unblocking the process for newly added tables is a better approach, and I am also interested in PG CDC and have enough time recently. I would like to collaborate with you, for instance, I can help implement certain functionalities or review and provide feedback on your Pull Requests.
By the way, my PR Add SNAPSHOT_ONLY mode for Incremental CDC Source may have influnce on it(because both will stop the stream split for difference purpose), so I will complete it this week without blocking this PR.

@sap1ens
Copy link
Contributor

sap1ens commented Dec 20, 2023

@leonardBang I've attempted to apply new updates from the PRs I identified, but, unfortunately, it's just too much work at the moment for me, I only have a few working days left in the year. I'm also not sure that this list of PRs is complete. Likely it's not and copying changes requires comparing all relevant files one-by-one.

But I do believe it's an important change and waiting longer will increase the difference between the connectors even more. So I'd appreciate any help here, FYI @loserwang1024.

Once the cdc-base is updated, I'm happy to contribute Postgres-specific changes and tests.

@loserwang1024
Copy link
Contributor

@leonardBang , I'd like to do it. @sap1ens, thanks a lot , being able to reference your past work will help me avoid a lot of trouble.

@PatrickRen
Copy link
Contributor Author

Closing this issue because it was created before version 2.3.0 (2022-11-10). Please try the latest version of Flink CDC to see if the issue has been resolved. If the issue is still valid, kindly report it on Apache Jira under project Flink with component tag Flink CDC. Thank you!

@PatrickRen PatrickRen closed this as not planned Won't fix, can't repro, duplicate, stale Feb 28, 2024
@sap1ens
Copy link
Contributor

sap1ens commented Feb 28, 2024

Actually, this was implemented in #3024

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
3.1 enhancement New feature or request task The development task
Projects
None yet
Development

Successfully merging a pull request may close this issue.

7 participants