Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RetentionLeaseSyncIT.testRetentionLeasesSyncOnRecovery fails reproducibly on master and 6.x #38487

Closed
jkakavas opened this issue Feb 6, 2019 · 2 comments
Assignees
Labels
:Distributed Indexing/Distributed A catch all label for anything in the Distributed Area. Please avoid if you can. >test-failure Triaged test failures from CI

Comments

@jkakavas
Copy link
Member

jkakavas commented Feb 6, 2019

This looks like it was introduced in #38435. I hit it in a PR build, bit it reproduces consistently on master for me and looking through build failures it has been failing for the last few hours:

1:59:11 FAILURE 0.55s J4 | RetentionLeaseSyncIT.testRetentionLeasesSyncOnRecovery <<< FAILURES!
01:59:11    > Throwable #1: java.lang.AssertionError: shard [index][0], node[Ar2fOmwrR6OAWLivoCoGkA], relocating [J7fySsp9S8KPoeJf_8_q0w], [P], s[RELOCATING], a[id=dmm314dRQ0e7iieuOxH2gw, rId=x0Mzax-eStCxDuQdB39dJg] is not a primary shard in primary mode
01:59:11    > 	at __randomizedtesting.SeedInfo.seed([51682F94E41D8601:1B7C4AFD6CF749F6]:0)
01:59:11    > 	at org.elasticsearch.index.shard.IndexShard.assertPrimaryMode(IndexShard.java:1580)
01:59:11    > 	at org.elasticsearch.index.shard.IndexShard.renewRetentionLease(IndexShard.java:1954)
01:59:11    > 	at org.elasticsearch.index.seqno.RetentionLeaseSyncIT.testRetentionLeasesSyncOnRecovery(RetentionLeaseSyncIT.java:247)
01:59:11    > 	at java.lang.Thread.run(Thread.java:748)

Reproduction line:

REPRODUCE WITH: ./gradlew :server:integTest -Dtests.seed=51682F94E41D8601 \ -Dtests.class=org.elasticsearch.index.seqno.RetentionLeaseSyncIT \ -Dtests.method="testRetentionLeasesSyncOnRecovery" -Dtests.security.manager=true \ -Dtests.locale=da -Dtests.timezone=Pacific/Gambier -Dcompiler.java=11 -Druntime.java=8 

I'l be muting this and I'm assigning Jason so that he gets the ping in case he wants to push a quick test fix

@jkakavas jkakavas added >test-failure Triaged test failures from CI :Distributed Indexing/Distributed A catch all label for anything in the Distributed Area. Please avoid if you can. labels Feb 6, 2019
@elasticmachine
Copy link
Collaborator

Pinging @elastic/es-distributed

@jasontedor
Copy link
Member

I have #38483 opened for this which addresses this by ensuring the index will not be relocated once we start creating retention leases.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
:Distributed Indexing/Distributed A catch all label for anything in the Distributed Area. Please avoid if you can. >test-failure Triaged test failures from CI
Projects
None yet
Development

No branches or pull requests

3 participants