Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Seg Fault on Java 21 JRE #748

Closed
thai-op opened this issue Jan 14, 2024 · 8 comments
Closed

Seg Fault on Java 21 JRE #748

thai-op opened this issue Jan 14, 2024 · 8 comments
Labels
bug This issue is a bug. closed-for-staleness p2 This is a standard priority issue

Comments

@thai-op
Copy link

thai-op commented Jan 14, 2024

Describe the bug

This is an error message from a job that I run in a container in AWS batch. I don't have the core dump and just the message:

#
# A fatal error has been detected by the Java Runtime Environment:
#
# SIGSEGV (0xb) at pc=0x00007f3394e82bdb, pid=1, tid=92
#
 Environment Temurin-21.0.1+12 (21.0.1+12) (build 21.0.1+12-LTS)
 Environment Temurin-21.0.1+12 (21.0.1+12) (build 21.0.1+12-LTS)
# Java VM: OpenJDK 64-Bit Server VM Temurin-21.0.1+12 (21.0.1+12-LTS, mixed mode, sharing, tiered, compressed class ptrs, z gc, linux-amd64)
# Problematic frame:
# C [AWSCRT_9028473956044655129libaws-crt-jni.so+0x8bbdb] Java_software_amazon_awssdk_crt_http_HttpStreamBase_httpStreamBaseIncrementWindow+0x1b
#
# Core dump will be written. Default location: //core.1
#
# An error report file with more information is saved as:
# //hs_err_pid1.log
[144.782s][warning][os] Loading hsdis library failed
#
# If you would like to submit a bug report, please visit:
# https://github.com/adoptium/adoptium-support/issues
# The crash happened outside the Java Virtual Machine in native code.
# See problematic frame for where to report the bug.
#
[error occurred during error reporting (), id 0xb, SIGSEGV (0xb) at pc=0x00007f38bb2ea898]

The core interaction with AWS involves DynamoDB as a cache, SSM parameter stores, secret values, etc. nothing special there. We heavily use virtual threads for non-blocking network IO with high parallelism. This was working fine using the default Apache Http client but we wanted the newer non-blocking http client so this is disappointing.

JRE Version: Temurin-21.0.1+12 running on Redhat UBI9 image.

Expected Behavior

It should not seg fault.

Current Behavior

It killed the vm.

Reproduction Steps

No way to reproduce thus far.

Possible Solution

No response

Additional Information/Context

No response

aws-crt-java version used

2.22.13

Java version used

21.0.1+12

Operating System and version

Redhat UBI9

@thai-op thai-op added bug This issue is a bug. needs-triage This issue or PR still needs to be triaged. labels Jan 14, 2024
@TingDaoK
Copy link
Contributor

aws/aws-sdk-java-v2#4833 this PR from aws-sdk-java-v2 probably fixes the issue

@TingDaoK TingDaoK added response-requested Waiting on additional info and feedback. Will move to 'closing-soon' in 7 days. and removed bug This issue is a bug. needs-triage This issue or PR still needs to be triaged. labels Jan 17, 2024
@thai-op
Copy link
Author

thai-op commented Jan 17, 2024

Thanks, if there’s a newer sdk version I’ll try it today to see if things won’t crash again

@github-actions github-actions bot removed the response-requested Waiting on additional info and feedback. Will move to 'closing-soon' in 7 days. label Jan 17, 2024
@jmklix jmklix added the response-requested Waiting on additional info and feedback. Will move to 'closing-soon' in 7 days. label Jan 18, 2024
@ravi-signal
Copy link

Hi, I also hit this (on 2.23.1 initially). I tried upgrading to 2.23.5, which AIUI includes aws/aws-sdk-java-v2#4833 , and I continue to see it:

#
# A fatal error has been detected by the Java Runtime Environment:
#
#  SIGSEGV (0xb) at pc=0x00007f07fc28c4eb, pid=1, tid=131
#
# JRE version: OpenJDK Runtime Environment Temurin-21.0.1+12 (21.0.1+12) (build 21.0.1+12-LTS)
# Java VM: OpenJDK 64-Bit Server VM Temurin-21.0.1+12 (21.0.1+12-LTS, mixed mode, sharing, tiered, compressed oops, compressed class ptrs, g1 gc, linux-amd64)
# Problematic frame:
# C  [AWSCRT_15418525997656330583libaws-crt-jni.so+0x8c4eb]  Java_software_amazon_awssdk_crt_http_HttpStreamBase_httpStreamBaseIncrementWindow+0x1b
#

@github-actions github-actions bot removed the response-requested Waiting on additional info and feedback. Will move to 'closing-soon' in 7 days. label Jan 18, 2024
@jmklix jmklix added bug This issue is a bug. p2 This is a standard priority issue labels Jan 22, 2024
@jmklix
Copy link
Member

jmklix commented Jan 22, 2024

The most recent merged PR should fix this Seg Fault. Please use the most recent version of the java-v2 sdk and let us know if you run into any problems

@jmklix jmklix added the response-requested Waiting on additional info and feedback. Will move to 'closing-soon' in 7 days. label Jan 22, 2024
@TingDaoK
Copy link
Contributor

To add more detail. aws/aws-sdk-java-v2#4839 PR from java SDK is supposed to fix the issue. They plan to release another version today.

@github-actions github-actions bot removed the response-requested Waiting on additional info and feedback. Will move to 'closing-soon' in 7 days. label Jan 22, 2024
@zoewangg
Copy link
Contributor

Hey all, a fix has been released as part of 2.23.8. Could you try it and let us know if you continue to see the issue?

@zoewangg zoewangg added the response-requested Waiting on additional info and feedback. Will move to 'closing-soon' in 7 days. label Jan 22, 2024
@ravi-signal
Copy link

I have not been able to reproduce the issue on 2.23.8. FWIW, I was able to reproduce the issue fairly quickly with the previous builds.

Thank you for the quick fix/release cycle!

@github-actions github-actions bot removed the response-requested Waiting on additional info and feedback. Will move to 'closing-soon' in 7 days. label Jan 23, 2024
@TingDaoK TingDaoK added the closing-soon This issue will automatically close in 4 days unless further comments are made. label Feb 1, 2024
@github-actions github-actions bot added closed-for-staleness and removed closing-soon This issue will automatically close in 4 days unless further comments are made. labels Feb 1, 2024
@github-actions github-actions bot closed this as completed Feb 1, 2024
@porwalu
Copy link

porwalu commented Nov 12, 2024

Hello Team,
We are seeing this on version 2.22.0 and our main use case it for accessing S3
However, we are seeing this issue on some of our environments and not all. It occurs after 18 hours - 36 hours on different environments.
While it is good to see this resolved, is there a way to reproduce it at will so that we can test it sooner and confidently rather than waiting for a long interval based on historic crashes?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug This issue is a bug. closed-for-staleness p2 This is a standard priority issue
Projects
None yet
Development

No branches or pull requests

6 participants