-
Notifications
You must be signed in to change notification settings - Fork 25k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OutOfMemoryError not killing the node #28967
Comments
Pinging @elastic/es-core-infra |
Relates to #21721. /cc @jasontedor @s1monw |
@nachogiljaldo Can you attach your complete log (in a gist)? This should not be possible. After the error message |
The first one is a bug and I will open a fix soon. The second one I do not believe without sufficiently strong evidence. The third one I am still investigating. |
@nachogiljaldo Would you please provide evidence that this one did not result in the node dying? |
@nachogiljaldo Do you have any more to this stack trace? |
|
Btw. this is printed to stdout: |
@henrikno Is there a stack trace? |
No, that's the weird thing. Just that line, several times, with different thread names.
|
Okay, no worries; I have a gut feeling what is occurring. I will try to reproduce and validate. This will knock out the first two of these. I will still need more information on the third one. |
Okay, here is the problem. The change #27482 introduced a bug that causes a security manager exception when we attempt to exit after a fatal error. This was immediately detected and #27518 was introduced. The bug from #27482 was suppose to have never been released yet the backport from #27518 to 6.1 was missed, it never happened. This means that #27482 is in 6.1 without the corresponding fix. This bug preventing exit in situations like this occurs in 6.1 only, upgrading to 6.2 will address the issue. |
I understand the situation with the third exception and will work on a fix. |
I opened #28973. |
Elasticsearch version (
bin/elasticsearch --version
): 6.1.3Plugins installed: [ingest-geoip,ingest-user-agent,repository-s3,x-pack]
JVM version (
java -version
):OS version (
uname -a
if on a Unix-like system):Linux bd9a03495c76 4.4.0-66-generic #87~14.04.1-Ubuntu SMP Fri Mar 3 17:32:36 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux
Description of the problem including expected versus actual behavior:
Node suffers an OOM error and the following message is logged:
[2018-03-09T04:18:01,808][ERROR][org.elasticsearch.bootstrap.ElasticsearchUncaughtExceptionHandler] fatal error in thread [elasticsearch[instance-0000000008][bulk][T#739]], exiting java.lang.OutOfMemoryError: Java heap space
However, the instance does not die and it keeps on throwing OOMs from time remaining totally unusable.
Steps to reproduce:
Provide logs (if relevant):
Lots of similar logs are there in the last 24h:
This is the full stacktrace of one of the errors:
This is the first OOM that was thrown and did not result on the instance dying:
Other, later, error I've seen that does not result on the instance dying:
The text was updated successfully, but these errors were encountered: