-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Disaster Recovery #321
Comments
There is a WAL S3 bucket and defined in the operator configuration. No secrets are supported yet, although WAL-E relies on the instance profile and there is a support for https://github.com/jtblin/kube2iam project in the form of the annotation supplied by the operator for the pods (regulated by the
Backups can be configured in Spilo (there is no need to trigger them from the operator), however, the operator currently doesn't make them configurable, deferring to the Spilo default of once a day for the
The operator will not anything with the backups, this is the task of WAL-E called from Spilo (the operator is just an engine to set environment variables to Spilo and create/maintain Kubernetes objects). WAL-E already removes backups and WAL segments older than NB: as of yesterday, the Spilo behavior has been change to treat BACKUP_NUM_TO_RETAIN to days instead of number of backups. |
As the word backup is also ambiguous, logical backups are not handled currently and we are not of the opinion the operator should be responsible here. We use Kubernetes jobs to do this. At some point though the operator may in fact create the K8S jobs but will not execute the backups or trigger them. |
there is the possibility to use the embedded wal-e for backups/restore as @alexeyklyukin said. you can customize the env variables: #152 |
I don't think we have an ETA for this now, the AWS credentials should imho come from kube2iam or similar solution that could do the setup. Is this not possible in your case? |
Hello, Thank you for the great operator. Do we have any options to configure BACKUP_SCHEDULE and BACKUP_NUM_TO_RETAIN on operator side or still no? With best regards, Oleg |
Hello again, Do you have any updates about following question? |
@excavador have you seen that the support for logical backups has been merged recently --> #442 |
Dear developers, I confused so much. My actual problem/task: What we have right now?
I would be so much happy to understand how to achieve my goals :) Thank you so much! |
We do not cover this use case completely. We currently only support via Spilo to keep N basebackups plus the continuous WAL stream for that time frame. Lets assume for now 7 days. This means PITR and restore from basebackup and WAL within these days is possible. Logical backup serves the purpose of storing compressed smaller dumps for a longer period of time while losing point in time recovery capability beyond the 7 days. Restore from logical is not covered yet. I assume this will be another custom bootstrap or a combination of DB + 1 job deployment in the future. @CyberDem0n can maybe comment on if your usecase of one very old base back (lets say initial one plus WAL) is there and would work. There is a little bit of doubt in my mind when it comes to the fail over behavior and the continuity of WAL. Thats why e.g. Spilo ships a base backup after promote. |
Yes, in theory. Actually, spilo directly choose latest available backup for specified date (in case of cluster "clone" from backup)
Yeah, it described in #568
Ideally, I prefer to see ability to "clone" from logical backup, like we already have for plain backups |
Yeah, for the clone I guess I was mixing spec and implementation. |
@Jan-M so, shortly, I just want to have ability to configure backup frequency using Postgres CRD object manifest, and flexible options to restore (using "clone") with Postgres CRD object manifest. |
Over time the operator will improve in that area exposing more options to the manifest where it makes sense and leaving some options to be determinded on the global level. On some issues we are working and others are always open to be contributed to. |
Hi! Best regards, Oleg |
As far as I understand, the operator boots Spilo containers which have wal-e installed. Therefore, it would be useful to be able to provide wal-e parameters to make it ship periodic backups and wal to an object store like S3.
It would be nice to be able to:
The text was updated successfully, but these errors were encountered: