Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Exception occurred in REST handler: Job X not found #256

Open
liad5h opened this issue Jul 4, 2022 · 3 comments
Open

Exception occurred in REST handler: Job X not found #256

liad5h opened this issue Jul 4, 2022 · 3 comments

Comments

@liad5h
Copy link

liad5h commented Jul 4, 2022

Hey,

I am using the operator in version docker.io/lyft/flinkk8soperator:1355d206b5fb4efd6f6e4ccf24085a87a29443c5.
Running ok aws eks version 1.21.

Sometimes The job manager floods the log with this message and when it starts, I am unable to redeploy the flinkapp without reaching the "DeployFailed" state

log: 2022-07-04 06:03:35,466 ERROR org.apache.flink.runtime.rest.handler.job.JobDetailsHandler [] - Exception occurred in REST handler: Job <HASH> not found

at the same time, task manager does not have any logs in it (makes sense)

in the operator logs I see the below log for multiple flink apps:
{"json":{"app_name":"esp-process-666","ns":"int-streaming","phase":"Running"},"level":"warning","msg":"Failed to reconcile resource <NAMESPACE>/<APP NAME>: GetJobOverview call failed with status 404 Not Found and message ''","ts":"2022-07-04T06:08:35Z"}

is this a known issue?
how do I recover from this without deleting and redeploying the flink app?

@L-LGL
Copy link

L-LGL commented Jun 14, 2023

I have a similar problem and wish I had an answer sooner

@lydian
Copy link

lydian commented Aug 25, 2023

I also have the similar issue.

@liad5h
Copy link
Author

liad5h commented Aug 26, 2023

I tried to fix this by enabling kubernetes HA but then i had other issues with checkpoints.

i ended up replacing this operator with this operator https://github.com/apache/flink-kubernetes-operator.
So far it works much better

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants