Added short-term caching of the first column in the channel manifest #180

billkalter · 2018-06-20T19:54:46Z

Github Issue

None

What Are We Doing Here?

An inefficiency was uncovered when the head of the "manifest" table for queues and databus subscriptions have accrued many tombstones which have not yet been compacted away. Since every poll re-reads the manifest from the beginning the entire poll is slowed down as Cassandra scans over the tombstoned records. This PR briefly caches the oldest know slab ID in the manifest plus a 1 minute buffer. Future queries can be run starting at this manifest to bypass any tombstones from older, fully read and deleted slabs.

How to Test and Verify

There is no test specifically for this condition. The most important test is regression.

Risk

This is a fairly low-risk update. Even though it is at the heart of databus and queue channels the caching should serve as an optimization without risking that any manifest data goes completely unread.

Level

Medium

Required Testing

Regression

Code Review Checklist

Tests are included. If not, make sure you leave us a line or two for the reason.
Pulled down the PR and performed verification of at least being able to
build and run.
Well documented, including updates to any necessary markdown files. When
we inevitably come back to this code it will only take hours to figure out, not
days.
Consistent/Clear/Thoughtful? We are better with this code. We also aren't
a victim of rampaging consistency, and should be using this course of action.
We don't have coding standards out yet for this project, so please make sure to address any feedback regarding STYLE so the codebase remains consistent.
PR has a valid summary, and a good description.

sujithvaddi · 2018-06-20T20:13:19Z

event/src/main/java/com/bazaarvoice/emodb/event/db/astyanax/AstyanaxEventReaderDAO.java

+        try {
+            // Subtract 1 minute from the slab ID to allow for a reasonable window of out-of-order writes while
+            // constraining the number of tombstones read to 1 minute's worth of rows.
+            _oldestSlab.get(channel, () ->


@billkalter should it be put() here instead of get() :?

Agreed, this is very confusing and I tried to put a comment to clarify. With the newer Java interfaces like ConcurrentMap you can use computeIfAbsent() or putIfAbsent(). The Guava cache interface doesn't have a similar method, but if you do a get() this way it does the same thing: caches the new value only if there is no current un-expired version in the cache. From their docs:

This method provides a simple substitute for the conventional "if cached, return; otherwise create, cache and return" pattern.

sujithvaddi

@billkalter looks good.

billkalter added 2 commits June 20, 2018 14:29

Added short-term caching of the first column in the channel manifest

42e5656

Added 1 minute buffer to reduce impact of out-of-order writes

a9b8ecc

sujithvaddi reviewed Jun 20, 2018

View reviewed changes

sujithvaddi approved these changes Jun 20, 2018

View reviewed changes

billkalter merged commit 98d6195 into bazaarvoice:master Jun 20, 2018

billkalter deleted the cache-manifest-head branch June 20, 2018 20:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added short-term caching of the first column in the channel manifest #180

Added short-term caching of the first column in the channel manifest #180

billkalter commented Jun 20, 2018

sujithvaddi Jun 20, 2018 •

edited

Loading

billkalter Jun 20, 2018

sujithvaddi left a comment

Added short-term caching of the first column in the channel manifest #180

Added short-term caching of the first column in the channel manifest #180

Conversation

billkalter commented Jun 20, 2018

Github Issue

What Are We Doing Here?

How to Test and Verify

Risk

Level

Required Testing

Code Review Checklist

sujithvaddi Jun 20, 2018 • edited Loading

Choose a reason for hiding this comment

billkalter Jun 20, 2018

Choose a reason for hiding this comment

sujithvaddi left a comment

Choose a reason for hiding this comment

sujithvaddi Jun 20, 2018 •

edited

Loading