Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

azure-spring-cloud-starter-eventhubs stops picking up events and won't start again till service restart #19848

Closed
J4cku opened this issue Mar 15, 2021 · 13 comments
Assignees
Labels
azure-spring All azure-spring related issues azure-spring-eventhubs Spring event hubs related issues. bug This issue requires a change to an existing behavior in the product in order to be resolved. Client This issue points to a problem in the data-plane of the library. customer-reported Issues that are reported by GitHub users external to the Azure organization. question The issue doesn't require a change to the product in order to be resolved. Most issues start as that
Milestone

Comments

@J4cku
Copy link

J4cku commented Mar 15, 2021

Hi,
We observed a weird problem that is near impossible for us to debug.
After some time of working fine our EventHub listener stops picking up new events (no idea why).
Logs look fine as always, e.g.

2021-03-15 10:10:52.315  INFO [qiaspace-device-registration,,,] 1 --- [     elastic-15] c.a.m.e.PartitionBasedLoadBalancer       : Starting load balancer for 06bbbb0e-f94f-476d-b330-d3fc229922dc
2021-03-15 10:10:52.315  INFO [qiaspace-device-registration,,,] 1 --- [     elastic-15] c.a.m.e.PartitionBasedLoadBalancer       : CheckpointStore returned 2 ownership records
2021-03-15 10:10:52.315  INFO [qiaspace-device-registration,,,] 1 --- [     elastic-15] c.a.m.e.PartitionBasedLoadBalancer       : Event Hubs service returned 2 partitions
2021-03-15 10:10:52.316  INFO [qiaspace-device-registration,,,] 1 --- [     elastic-15] c.a.m.e.PartitionBasedLoadBalancer       : Number of active ownership records 2
2021-03-15 10:10:52.316  INFO [qiaspace-device-registration,,,] 1 --- [     elastic-15] c.a.m.e.PartitionBasedLoadBalancer       : Number of active event processors 1
2021-03-15 10:10:52.316  INFO [qiaspace-device-registration,,,] 1 --- [     elastic-15] c.a.m.e.PartitionBasedLoadBalancer       : Expected min partitions per event processor = 2, expected number of event processors with additional partition = 0
2021-03-15 10:10:52.316  INFO [qiaspace-device-registration,,,] 1 --- [     elastic-15] c.a.m.e.PartitionBasedLoadBalancer       : Load is balanced with this event processor owning 2 partitions
2021-03-15 10:10:52.318  INFO [qiaspace-device-registration,,,] 1 --- [     elastic-15] c.a.m.e.PartitionBasedLoadBalancer       : Load balancing completed successfully

Load balancing is done regularly, but for some reason until I restart the service it is not picking up new events.
I have literally no idea what can we do to help investigate the problem. Same thing happen with integration as well as binder.
It seams to have worked fine earlier when we used @AzureListener (but we moved to webflux, and that way does not work without spring-web)

@ghost ghost added needs-triage Workflow: This is a new issue that needs to be triaged to the appropriate team. customer-reported Issues that are reported by GitHub users external to the Azure organization. question The issue doesn't require a change to the product in order to be resolved. Most issues start as that labels Mar 15, 2021
@J4cku
Copy link
Author

J4cku commented Mar 15, 2021

I kinda lied. @AzureMessageListener still works with webflux; we are trying that now

@joshfree joshfree added azure-spring All azure-spring related issues azure-spring-eventhubs Spring event hubs related issues. Client This issue points to a problem in the data-plane of the library. labels Mar 15, 2021
@ghost ghost removed the needs-triage Workflow: This is a new issue that needs to be triaged to the appropriate team. label Mar 15, 2021
@joshfree joshfree added this to the [2021] April milestone Mar 15, 2021
@joshfree
Copy link
Member

@conniey could you please take a look?

@conniey
Copy link
Member

conniey commented Mar 18, 2021

This looks like it could be related to #18070. The spring starter uses azure-messaging-eventhubs under the hood.

@saragluna saragluna self-assigned this Mar 22, 2021
@saragluna
Copy link
Member

@J4cku
Copy link
Author

J4cku commented Mar 22, 2021

@saragluna I haven't as the version is not even present yet in maven central:
image

@saragluna
Copy link
Member

saragluna commented Mar 22, 2021

@J4cku
Ah, you could try with version 2.2.0. We're in the process of releasing 2.3.0 today. But it's blocked by our eng pipeline.

@J4cku
Copy link
Author

J4cku commented Mar 22, 2021

@saragluna I already did with 2.2.0 and it fails there

@wujack778
Copy link
Contributor

@J4cku Is it convenient to post your source code address?

@saragluna
Copy link
Member

@J4cku any update on this issue?

@J4cku
Copy link
Author

J4cku commented Apr 7, 2021

@saragluna we ditched the binder implementation; we use now @AzureMessageListener implementation instead since that one does not get stuck

@saragluna
Copy link
Member

@J4cku glad the listener approach works, but do you mind providing the pom file when using the binder? It will help us investigate what's wrong with the binder.

@conniey
Copy link
Member

conniey commented Apr 14, 2021

This is probably related to #18070. We released an update to the underlying library this morning.

@chenrujun chenrujun added the bug This issue requires a change to an existing behavior in the product in order to be resolved. label Apr 27, 2021
@saragluna saragluna assigned yiliuTo and unassigned backwind1233 and saragluna May 20, 2021
@yiliuTo
Copy link
Member

yiliuTo commented May 28, 2021

As mentioned by @conniey , this issue has been fixed in com.azure:azure-messaging-eventhubs;5.7.0. So you could use azure-spring-cloud-stream-binder-eventhubs 2.4.0 and forward. If the issue still exists, please open this issue again.

@yiliuTo yiliuTo closed this as completed May 28, 2021
@github-actions github-actions bot locked and limited conversation to collaborators Apr 12, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
azure-spring All azure-spring related issues azure-spring-eventhubs Spring event hubs related issues. bug This issue requires a change to an existing behavior in the product in order to be resolved. Client This issue points to a problem in the data-plane of the library. customer-reported Issues that are reported by GitHub users external to the Azure organization. question The issue doesn't require a change to the product in order to be resolved. Most issues start as that
Projects
None yet
Development

No branches or pull requests

8 participants