Add stores for read-through and write-through caching #220

rubanm · 2014-02-26T01:57:11Z

Addresses #216

Merge from twitter/storehaus

MansurAshraf · 2014-02-26T02:14:58Z

storehaus-core/src/main/scala/com/twitter/storehaus/ReadThroughStore.scala

+class ReadThroughStore[K, V](backingStore: ReadableStore[K, V], cache: Store[K, V])
+  extends ReadableStore[K, V] {
+
+  override def get(k: K): Future[Option[V]] = {


Dont we need 'put if absent' semantic to be atomic to prevent a cache stampede. I was thinking we will have to take AsyncMutex and flatmap that before we do anything. Or is the expectation that client will ensure that store is used in a threadsafe manner?

Good question. Not sure if we need "put if absent". I guess the idea is to refresh the cache value on every write (for the write-through case).

Agree on making this threadsafe with a mutex. It'd be nice to have a more lightweight / lock-free solution here, though I can't think of one. I'll go ahead and add a mutex unless someone has a different way to fix this.

rubanm · 2014-02-28T02:57:04Z

Added mutex.
Made all cache operations best effort.
Any backing store failures are propagated to the client.

MansurAshraf · 2014-02-28T03:10:30Z

storehaus-core/src/main/scala/com/twitter/storehaus/ReadThroughStore.scala

+    cache.get(k).flatMap { cacheValue =>
+      cacheValue match {
+        case None => getFromBackingStore(k)
+        case some => Future.value(some)


This maybe premature optimization so feel free to ignore but it would be sweet if we only use mutex in scenario where there is a cache miss and we have to go to the backing store. In case of a cache hit there is no need to lock as its just a read only operation. I feel like cache hit will be the most common scenario and we should try to make it lock free if we can

Makes sense. In database terms, this would give us "read committed" isolation level, which should be good enough in this case I think.

MansurAshraf · 2014-02-28T03:56:26Z

storehaus-core/src/main/scala/com/twitter/storehaus/ReadThroughStore.scala

+  override def get(k: K): Future[Option[V]] =
+    cache.get(k).flatMap { cacheValue =>
+      cacheValue match {
+        case None => getFromBackingStore(k)


erm.. I think there is a bug in this code now. Imagine two threads coming one after the other in a cache miss scenario. First thread will do a cache.get, acquire the mutex and do a get on the backing store. In the mean time second thread will do a cache.get, get a miss and block on getFromBackingStore till the first thread is done. Then it will acquire the lock, do a get on the backingstore again and repopulate the cache. I think you may need to do a cache get inside the getFromBackingStore function, just to check that cache is not already populated by the previous thread.

I do see a case where we can have threads queued up trying to get a hot key on cache miss. Also a cache get is probably going to be less expensive than getting from backing store again. But, is this expected to happen fairly frequently?

Just wondering if we need to have an additional cache get in the read path each time to account for this case. What do you think?

I think an additional cache get would be fine

Thought some more about this. Looks like we'll need to place the cache get inside the mutex block if we want this behavior.

The current code doesn't break any store semantics as the latest value in backing store is written to cache.

So we can either keep the existing behavior and revisit if this turns out to be a perf issue, or, make the mutex block larger. I propose we do the former.

@MansurAshraf @johnynek thoughts?

johnynek · 2014-03-07T21:41:54Z

Let's merge and get it in the next release and polish as needed.

Add stores for read-through and write-through caching

rubanm · 2014-03-07T23:13:28Z

Cool.

rubanm and others added 3 commits February 25, 2014 11:04

Merge pull request #17 from twitter/develop

b50b274

Merge from twitter/storehaus

add readthrough store

c270d2c

add writethrough store

3209b5b

MansurAshraf reviewed Feb 26, 2014
View reviewed changes

improvements

5a5b7ee

MansurAshraf reviewed Feb 28, 2014
View reviewed changes

make mutex usage lighter

29cc5fc

MansurAshraf reviewed Feb 28, 2014
View reviewed changes

johnynek added a commit that referenced this pull request Mar 7, 2014

Merge pull request #220 from rubanm/feature/writethrough

d338ba6

Add stores for read-through and write-through caching

johnynek merged commit d338ba6 into twitter:develop Mar 7, 2014

rubanm deleted the feature/writethrough branch March 12, 2014 21:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add stores for read-through and write-through caching #220

Add stores for read-through and write-through caching #220

rubanm commented Feb 26, 2014

MansurAshraf Feb 26, 2014

rubanm Feb 26, 2014

rubanm commented Feb 28, 2014

MansurAshraf Feb 28, 2014

rubanm Feb 28, 2014

MansurAshraf Feb 28, 2014

rubanm Feb 28, 2014

MansurAshraf Mar 6, 2014

rubanm Mar 7, 2014

johnynek commented Mar 7, 2014

rubanm commented Mar 7, 2014

Add stores for read-through and write-through caching #220

Add stores for read-through and write-through caching #220

Conversation

rubanm commented Feb 26, 2014

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rubanm commented Feb 28, 2014

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

johnynek commented Mar 7, 2014

rubanm commented Mar 7, 2014