Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix NPE when manually flushing a batch #2444

Closed
wants to merge 1 commit into from

Conversation

luciopaiva
Copy link
Contributor

@luciopaiva luciopaiva commented Jul 12, 2023

When manually flushing a batch with a set size, we can potentially run into a NPE if the command queue is empty when flush() is called. This PR is fixing that.

I ran into this problem while working on a code that sends commands to Redis in batches. In my code I defined an interface like this:

@BatchSize(100)
public interface MyCommands extends Commands, BatchExecutor {

    public void set(String key, String value);
}

For each batch, no matter how many commands it has, I want to make sure that it flushes the commands as soon as I finish enqueuing them, like this:

for (int i = 0; i < myBatchSize; i++) {
    api.set(key[i], value[i]);
}
api.flush();

And that's when I ran into a problem: when myBatchSize == 100, after the 100th is enqueued a flush is automatically triggered because of @BatchSize(100), and the queue gets emptied. Right after, api.flush() is called and Lettuce raises a NPE.

A simpler way to reproduce the problem is to just call flush() without ever enqueueing anything, so that's what I did in the test that I wrote.

The fix is really simple and just involves replacing a do..while with a while block that first checks if the queue is empty before trying to poll from it.

The bug is there since at least 5.2 (the earliest version I tested).


Make sure that:

  • You have read the contribution guidelines.
  • You have created a feature request first to discuss your contribution intent. Please reference the feature request ticket number in the pull request.
  • You use the code formatters provided here and have them applied to your changes. Don’t submit any formatting related changes.
  • You submit test cases (unit or integration tests) that back your changes.

RedisCommand<Object, Object, Object> poll = queue.poll();

assert poll != null;
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am removing this assert because it doesn't help with production code. I searched for uses of assert throughout the code and found only this and another one a few lines below, which I'm also removing.

@@ -166,7 +166,6 @@ private List<RedisCommand<Object, Object, Object>> prepareDefaultFlush(int consu

RedisCommand<Object, Object, Object> poll = queue.poll();

assert poll != null;
batch.add(poll);
Copy link
Contributor Author

@luciopaiva luciopaiva Jul 12, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The code here in prepareDefaultFlush() is already checking if the queue is empty before poll()ing from it, so I didn't have to do anything.

@luciopaiva luciopaiva force-pushed the fix-flush-batch-npe branch 2 times, most recently from 0a94832 to 6b05b7c Compare July 12, 2023 18:28
@luciopaiva luciopaiva force-pushed the fix-flush-batch-npe branch from 6b05b7c to 971cfec Compare July 12, 2023 19:12
@mp911de
Copy link
Collaborator

mp911de commented Jul 13, 2023

Care to provide the stack trace showing where the exception originates from?

@luciopaiva
Copy link
Contributor Author

Care to provide the stack trace showing where the exception originates from?

Sure. This is 6.3.0.BUILD-SNAPSHOT when I run the new test without the fix:

java.lang.NullPointerException
	at io.lettuce.core.StatefulRedisConnectionImpl.preProcessCommand(StatefulRedisConnectionImpl.java:201)
	at io.lettuce.core.StatefulRedisConnectionImpl.dispatch(StatefulRedisConnectionImpl.java:159)
	at io.lettuce.core.dynamic.SimpleBatcher.doFlush(SimpleBatcher.java:138)
	at io.lettuce.core.dynamic.SimpleBatcher.flush(SimpleBatcher.java:108)
	at io.lettuce.core.dynamic.SimpleBatcher.flush(SimpleBatcher.java:85)
	at io.lettuce.core.dynamic.BatchExecutableCommandLookupStrategy$1.execute(BatchExecutableCommandLookupStrategy.java:79)
	at io.lettuce.core.dynamic.RedisCommandFactory$CommandFactoryExecutorMethodInterceptor.invoke(RedisCommandFactory.java:235)
	at io.lettuce.core.dynamic.intercept.MethodInterceptorChain$MethodInterceptorContext.proceed(MethodInterceptorChain.java:118)
	at io.lettuce.core.dynamic.intercept.MethodInterceptorChain$PooledMethodInvocation.proceed(MethodInterceptorChain.java:201)
	at io.lettuce.core.dynamic.intercept.DefaultMethodInvokingInterceptor.invoke(DefaultMethodInvokingInterceptor.java:45)
	at io.lettuce.core.dynamic.intercept.MethodInterceptorChain$MethodInterceptorContext.proceed(MethodInterceptorChain.java:118)
	at io.lettuce.core.dynamic.intercept.MethodInterceptorChain.invoke(MethodInterceptorChain.java:80)
	at io.lettuce.core.dynamic.intercept.InvocationProxyFactory$InterceptorChainInvocationHandler.handleInvocation(InvocationProxyFactory.java:102)
	at io.lettuce.core.internal.AbstractInvocationHandler.invoke(AbstractInvocationHandler.java:80)
	at io.lettuce.core.dynamic.$Proxy47.flush(Unknown Source)
	at io.lettuce.core.dynamic.RedisCommandsBatchingIntegrationTests.shouldNotCrashWhenFlushCalledWithEmptyQueue(RedisCommandsBatchingIntegrationTests.java:110)

SimpleBatcher::doFlush() calls connection.dispatch() passing it a list with a null value, and then it eventually crashes inside StatefulRedisConnectionImpl::preProcessCommand() when it tries to read the null value as a command.

When manually flushing a batch with a set size, we could potentially run into a NPE if the command queue is empty when flush() is called.
@luciopaiva luciopaiva force-pushed the fix-flush-batch-npe branch from 971cfec to b7011eb Compare July 13, 2023 09:30
@mp911de mp911de added the type: bug A general bug label Jul 17, 2023
@mp911de mp911de self-assigned this Jul 17, 2023
@mp911de mp911de added this to the 6.2.5.RELEASE milestone Jul 17, 2023
mp911de pushed a commit that referenced this pull request Jul 17, 2023
When manually flushing a batch with a set size, we could potentially run into a NPE if the command queue is empty when flush() is called.
mp911de added a commit that referenced this pull request Jul 17, 2023
Guard code against potential null elements during concurrent flushing.
mp911de pushed a commit that referenced this pull request Jul 17, 2023
When manually flushing a batch with a set size, we could potentially run into a NPE if the command queue is empty when flush() is called.
mp911de added a commit that referenced this pull request Jul 17, 2023
Guard code against potential null elements during concurrent flushing.
@mp911de
Copy link
Collaborator

mp911de commented Jul 17, 2023

Thank you for your contribution. That's merged, polished, and backported now.

@mp911de mp911de closed this Jul 17, 2023
@luciopaiva luciopaiva deleted the fix-flush-batch-npe branch July 17, 2023 10:48
@luciopaiva
Copy link
Contributor Author

@mp911de how is this being backported? Will we see this fix in 5.x, 4.x, etc at some point?

@mp911de
Copy link
Collaborator

mp911de commented Jul 17, 2023

It is backported to Lettuce 6.2.5, which I released today. I maintain only a single bugfix branch at a time.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type: bug A general bug
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants