Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Flaky tests in TestPhoenixConnectorTest#testRowLevelDelete: NotServingRegionException: hbase:meta,,1.1588230740 is closing #9778

Closed
joshthoward opened this issue Oct 26, 2021 · 6 comments

Comments

@joshthoward
Copy link
Member

TestPhoenixConnectorTest#testRowLevelDelete is flaky.

Error:  Tests run: 229, Failures: 1, Errors: 0, Skipped: 54, Time elapsed: 471.792 s <<< FAILURE! - in TestSuite
Error:  io.trino.plugin.phoenix5.TestPhoenixConnectorTest.testRowLevelDelete  Time elapsed: 1.12 s  <<< FAILURE!
java.lang.RuntimeException: 
Failed after attempts=4, exceptions:
2021-10-26T16:57:21.382Z, RpcRetryingCaller{globalStartTime=1635267441379, pause=100, maxAttempts=4}, org.apache.hadoop.hbase.NotServingRegionException: org.apache.hadoop.hbase.NotServingRegionException: hbase:meta,,1.1588230740 is closing
	at org.apache.hadoop.hbase.regionserver.HRegion.startRegionOperation(HRegion.java:8455)
	at org.apache.hadoop.hbase.regionserver.HRegion.getScanner(HRegion.java:2972)
	at org.apache.hadoop.hbase.regionserver.HRegion.getScanner(HRegion.java:2967)
	at org.apache.hadoop.hbase.regionserver.HRegion.getScanner(HRegion.java:2961)
	at org.apache.hadoop.hbase.regionserver.RSRpcServices.get(RSRpcServices.java:2642)
	at org.apache.hadoop.hbase.regionserver.RSRpcServices.get(RSRpcServices.java:2569)
	at org.apache.hadoop.hbase.shaded.protobuf.generated.ClientProtos$ClientService$2.callBlockingMethod(ClientProtos.java:42274)
	at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:418)
	at org.apache.hadoop.hbase.ipc.CallRunner.run(CallRunner.java:133)
	at org.apache.hadoop.hbase.ipc.RpcExecutor$Handler.run(RpcExecutor.java:338)
	at org.apache.hadoop.hbase.ipc.RpcExecutor$Handler.run(RpcExecutor.java:318)
@joshthoward
Copy link
Member Author

Hit by #9715

@kokosing
Copy link
Member

Failed after attempts=4, exceptions:

Can we try to increase the number of attempts?

@losipiuk
Copy link
Member

Different test method but same cause: https://github.com/trinodb/trino/runs/4328474265?check_suite_focus=true

@ebyhr
Copy link
Member

ebyhr commented Nov 26, 2021

I guess the number of attempts comes from the value of HBASE_CLIENT_RETRIES_NUMBER (=hbase.client.retries.number) in TestingPhoenixServer. I will send a PR.

(hbase.client.retries.number * hbase.client.serverside.retries.multiplier) + ConnectionUtils.retries2Attempts increment

  • hbase.client.retries.number is 1 in TestingPhoenixServer
  • hbase.client.serverside.retries.multiplier is 3 by default

@findepi
Copy link
Member

findepi commented Jan 26, 2023

Thanks @ebyhr for looking into this!

can this issue be closed now?

@findepi findepi changed the title Flaky tests in TestPhoenixConnectorTest#testRowLevelDelete Flaky tests in TestPhoenixConnectorTest#testRowLevelDelete: NotServingRegionException: hbase:meta,,1.1588230740 is closing Jan 26, 2023
@ebyhr
Copy link
Member

ebyhr commented Jan 26, 2023

@findepi I think so. Thanks for the reminder.

@ebyhr ebyhr closed this as completed Jan 26, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging a pull request may close this issue.

5 participants