You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
we got this situation because two thread dead lock.
(This only occur when whole cluster shutdown, and only happens on leader if its two follower shutdown before it).
Thread 1: (Thread 132 in pstack)
running a rebuild index job ,
and call kvsotre::asyncRemoveRange,
but as its two follower already shutdown,
this thread will run into infinite loop until raft.stop called.
raft.stop() will be called by dtor of NebulaStore
Thread 2: (Thread 1 in pstack)
calling StorageServer::stop() to stop all service.
but we need to stop TaskManager before reset NebulaStore.
how ever, TaskManger need to wait all its task running.
which is the above thread.
Solution:
We may have a solution that we can set raft service stop at the stop() function of NebulaStore,
instead of the dtor of it.
Please check the FAQ documentation before raising an issue
Please check the FAQ documentation and old issues before raising an issue in case someone has asked the same question that you are asking.
Describe the bug (required)
Rebuild edge index running forever and storaged hang on exit.
seems that the index build task is stucked on a baton.wait():
the stucked method:
nebula/src/storage/admin/RebuildIndexTask.cpp
Lines 195 to 214 in a14d7b4
full pstack:
365749.txt
Your Environments (required)
uname -a
g++ --version
orclang++ --version
lscpu
How To Reproduce(required)
Steps to reproduce the behavior:
Expected behavior
A clear and concise description of what you expected to happen.
Additional context
Provide logs and configs, or any other context to trace the problem.
The text was updated successfully, but these errors were encountered: