Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[stardog] Disable backup restore testing #348

Merged
merged 3 commits into from
Sep 6, 2021

Conversation

psy-q
Copy link
Contributor

@psy-q psy-q commented Sep 4, 2021

What this PR does / why we need it:

The current tactic of comparing restored triple counts to backed up triple counts has caused problems with phantom databases remaining in Stardog's memory for months, leading to cluster sync problems. We're now also seeing write lock acquisition failures during the backup jobs, especially when long transactions are happening, which makes it less likely to actually get a working backup at all, let alone a restore. Backups on one large database rarely succeed even after running for 22h.

The restore also puts a lot of additional IOPS/CPU load on all nodes that could be better spent serving production workloads. The nodes not only have to restore the backup but also recompute indexes afterwards.

Thirdly, according to Stardog this measure is no longer necessary since the bug causing partial backups is said to be fixed.

We propose to create a separate, non-clustered Stardog node to test restores against instead of using the cluster. Until that's ready, this PR wants to comment out the restore functionality to see if it fixes some of the cluster sync problems.

Checklist

  • DCO signed
  • Variables are documented in the values.yaml using the format required by Helm-Docs.
  • Title of the PR contains starts with chart name e.g. [chart]

Signed-off by: Ramón Cahenzli [email protected]

Signed-off-by: Ramón Cahenzli <[email protected]>
Signed-off-by: Ramón Cahenzli <[email protected]>
@psy-q psy-q force-pushed the disable_backup_restore_testing branch from 834f920 to 198207e Compare September 4, 2021 13:05
@mhutter
Copy link
Member

mhutter commented Sep 6, 2021

Instead of commenting out stuff, why not simply remove it? This is Git after all....

Signed-off-by: Ramón Cahenzli <[email protected]>
@psy-q psy-q force-pushed the disable_backup_restore_testing branch from 7b14667 to d35e9e6 Compare September 6, 2021 07:46
@psy-q psy-q marked this pull request as ready for review September 6, 2021 08:38
@zugao zugao merged commit ee834f5 into master Sep 6, 2021
@zugao zugao deleted the disable_backup_restore_testing branch September 6, 2021 08:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants