Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Test Failure]RecoveryIT IndexStatsIT Tests Failure #5766

Closed
Bukhtawar opened this issue Jan 9, 2023 · 8 comments
Closed

[Test Failure]RecoveryIT IndexStatsIT Tests Failure #5766

Bukhtawar opened this issue Jan 9, 2023 · 8 comments
Labels
bug Something isn't working distributed framework flaky-test Random test failure that succeeds on second run

Comments

@Bukhtawar Bukhtawar added bug Something isn't working untriaged flaky-test Random test failure that succeeds on second run labels Jan 9, 2023
@mch2
Copy link
Member

mch2 commented Jan 10, 2023

Moving 2.5 specific tests to a separate issue.

@mch2 mch2 removed Severity-Blocker v2.5.0 'Issues and PRs related to version v2.5.0' labels Jan 10, 2023
@kotwanikunal kotwanikunal changed the title [Test Failure]RecoveryIT SegmentReplicationIT IndexStatsIT Tests Failure [Test Failure]RecoveryIT IndexStatsIT Tests Failure Jan 10, 2023
@kotwanikunal
Copy link
Member

kotwanikunal commented Jan 10, 2023

Segment Replication tests have been muted and are being root caused separately.
We are seeing them post the feature flag merge: #4959

@dreamer-89
Copy link
Member

dreamer-89 commented Jan 10, 2023

Tried this org.opensearch.backwards.IndexingIT.testSeqNoCheckpoints on latest main. I think few BWC tests are broken with this commit #5282 which introduced replicated boolean flag in ReplicationTracker. This happens as this flag is always sent irrespective of checking on receiver version.

./gradlew ':qa:mixed-cluster:v2.6.0#mixedClusterTest' --tests "org.opensearch.backwards.IndexingIT.testSeqNoCheckpoints" -Dtests.seed=BE95B870ADDCC4FF -Dtests.security.manager=true -Dtests.jvm.arg
line="-XX:TieredStopAtLevel=1 -XX:ReservedCodeCacheSize=64m" -Dtests.locale=en-ZA -Dtests.timezone=Etc/GMT-7 -Druntime.java=17
...
»  Caused by: org.opensearch.transport.RemoteTransportException: [v2.6.0-3][127.0.0.1:57996][internal:index/shard/recovery/start_recovery]
»  Caused by: org.opensearch.transport.RemoteTransportException: [v2.6.0-1][127.0.0.1:59081][internal:index/shard/recovery/handoff_primary_context]
»  Caused by: java.lang.IllegalStateException: unexpected byte [0x6d]
»       at org.opensearch.common.io.stream.StreamInput.readBoolean(StreamInput.java:588) ~[opensearch-2.6.0-SNAPSHOT.jar:2.6.0-SNAPSHOT]
»       at org.opensearch.common.io.stream.StreamInput.readBoolean(StreamInput.java:578) ~[opensearch-2.6.0-SNAPSHOT.jar:2.6.0-SNAPSHOT]
»       at org.opensearch.index.seqno.ReplicationTracker$CheckpointState.<init>(ReplicationTracker.java:713) ~[opensearch-2.6.0-SNAPSHOT.jar:2.6.0-SNAPSHOT]

@anasalkouz
Copy link
Member

Should be resolved. Closing the issue.

@saratvemulapalli
Copy link
Member

Reopening as thte flaky tests show up again on main.
#7674 (comment)

@andrross
Copy link
Member

@saratvemulapalli I'm going to resolve this as I believe there was a version check mismatch fixed in #7672 that caused the failure you linked. Please reopen if that's wrong.

@saratvemulapalli
Copy link
Member

Thanks @andrross. Rebase fixed it for me.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working distributed framework flaky-test Random test failure that succeeds on second run
Projects
None yet
Development

No branches or pull requests

8 participants