fix(cassandra) ensure single coordinator in migrations #2326

thibaultcha · 2017-04-03T22:02:28Z

Summary

This ensures we respect the proper 'single coordinator' pattern for
migrations when one of them is using the DAO's find_all() method.

Full changelog

ensure find_all() only uses the migrations coordinator.
ensure we wait for schema consensus before doing so, since the
coordinator will have to get responses about the table's content from
its peers.
perf: only wait for schema consensus if we ran some migrations at all.
properly pass C* timeout values set in the Kong configuration to the
driver.
new cassandra_schema_consensus_timeout property.
bump lua-cassandra to 1.2.1 which ensures we update the Nginx time
when testing for a schema consensus timeout.

NOTE: Do NOT merge yet.

This ensures we respect the proper 'single coordinator' pattern for migrations when one of them is using the DAO's `find_all()` method. * ensure `find_all()` only uses the migrations coordinator. * ensure we wait for schema consensus before doing so, since the coordinator will have to get responses about the table's content from its peers. * perf: only wait for schema consensus if we ran some migrations at all.

Implement a new `cassandra_schema_consensus_timeout` property to increase the C* `max_schema_consensus_wait` value. Particularly useful for clusters where the inter-nodes communication seems to be slow and the schema changes during migrations can take more than the default of 10s, and make the migration fail unnecessarily.

Tieske

I don't know how this works, as the magic seems to be in the driver.

Would it be possible to add a test for the behaviour?

Tieske · 2017-04-05T08:42:30Z

kong/dao/db/cassandra.lua

+    -- before performing such a DML query
+    local ok, err = self:wait_for_schema_consensus()
+    if not ok then
+      return nil, "could not wait for schema consensus: " .. err


"failed waiting for schema consensus"

Tieske · 2017-04-05T08:45:31Z

kong/dao/factory.lua

+      local ok, err = self.db:wait_for_schema_consensus()
+      if not ok then
+        return ret_error_string(self.db.name, nil,
+                                "could not wait for schema consensus: " .. err)


"failed waiting for..."

both forms are used intermittently in the codebase

I'd say that only impatient people 'cannot wait' 😄

thibaultcha · 2017-04-05T08:59:40Z

The "do not merge" label is for refused PRs.

Would it be possible to add a test for the behaviour?

Sadly not, or else it would be included.

thibaultcha · 2017-04-05T09:05:09Z

as the magic seems to be in the driver.

Actually the driver has little to do with this all.

Tieske · 2017-04-05T11:37:45Z

The "do not merge" label is for refused PRs.

surely we close those? this label at least is more descriptive than a foot note in the original post.

Tieske

considering updating the error messages optional.

thibaultcha · 2017-04-05T17:53:53Z

surely we close those?

It is sometimes more complicated than that.

I'd say that only impatient people 'cannot wait' 😄

Sounds just like the definition of a timeout!

This ensures we update the Nginx time between schema consensus timeout checks and also adds the ability to manually add and remove C* peers. thibaultcha/lua-cassandra@1.1.1...1.2.1

thibaultcha · 2017-04-07T18:26:52Z

considering updating the error messages optional.

Updated them in the end 😉

thibaultcha added 3 commits April 3, 2017 14:57

fix(cass) properly set C* timeout options

d000d30

thibaultcha added this to the 0.10.2 milestone Apr 4, 2017

Tieske added the pr/status/do not merge label Apr 5, 2017

Tieske reviewed Apr 5, 2017

View reviewed changes

thibaultcha removed the pr/status/do not merge label Apr 5, 2017

Tieske approved these changes Apr 5, 2017

View reviewed changes

chore(deps) bump lua-cassandra to 1.2.1

ddeea29

This ensures we update the Nginx time between schema consensus timeout checks and also adds the ability to manually add and remove C* peers. thibaultcha/lua-cassandra@1.1.1...1.2.1

thibaultcha force-pushed the fix/single-coordinator-migrations branch from 9ec9e79 to ddeea29 Compare April 6, 2017 22:14

thibaultcha merged commit 2d3bb54 into master Apr 7, 2017

thibaultcha deleted the fix/single-coordinator-migrations branch April 7, 2017 18:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(cassandra) ensure single coordinator in migrations #2326

fix(cassandra) ensure single coordinator in migrations #2326

thibaultcha commented Apr 3, 2017 •

edited

Loading

Tieske left a comment

Tieske Apr 5, 2017

Tieske Apr 5, 2017

thibaultcha Apr 5, 2017 •

edited

Loading

Tieske Apr 5, 2017

thibaultcha commented Apr 5, 2017 •

edited

Loading

thibaultcha commented Apr 5, 2017

Tieske commented Apr 5, 2017

Tieske left a comment

thibaultcha commented Apr 5, 2017

thibaultcha commented Apr 7, 2017

fix(cassandra) ensure single coordinator in migrations #2326

fix(cassandra) ensure single coordinator in migrations #2326

Conversation

thibaultcha commented Apr 3, 2017 • edited Loading

Summary

Full changelog

Tieske left a comment

Choose a reason for hiding this comment

Tieske Apr 5, 2017

Choose a reason for hiding this comment

Tieske Apr 5, 2017

Choose a reason for hiding this comment

thibaultcha Apr 5, 2017 • edited Loading

Choose a reason for hiding this comment

Tieske Apr 5, 2017

Choose a reason for hiding this comment

thibaultcha commented Apr 5, 2017 • edited Loading

thibaultcha commented Apr 5, 2017

Tieske commented Apr 5, 2017

Tieske left a comment

Choose a reason for hiding this comment

thibaultcha commented Apr 5, 2017

thibaultcha commented Apr 7, 2017

thibaultcha commented Apr 3, 2017 •

edited

Loading

thibaultcha Apr 5, 2017 •

edited

Loading

thibaultcha commented Apr 5, 2017 •

edited

Loading