Skip to content
This repository has been archived by the owner on Jun 5, 2018. It is now read-only.

SRV records for stopped instances don't always get removed #21

Open
aaronbuchanan opened this issue Jul 28, 2017 · 3 comments
Open

SRV records for stopped instances don't always get removed #21

aaronbuchanan opened this issue Jul 28, 2017 · 3 comments

Comments

@aaronbuchanan
Copy link

We were getting intermittent 502 errors following deployments, upon investigation we found SRV records existed for tasks that had been stopped. Curious if anyone has seen similar issues with this approach? Is there a best practice for keeping the SRV records healthy?

@jogster
Copy link

jogster commented Jul 31, 2017

I found the same. This is because the ecssd_agent is listening to events on the docker port to register/deregister the Route53 SRV records.

Are you running the healthcheck lamda? That should be able to detect dead SRV record entries and remove them automatically.

@wprater
Copy link

wprater commented Aug 22, 2017

Are you running the healthcheck lamda? That should be able to detect dead SRV record entries and remove them automatically.

Can look into this, but I imagine there will be more lag than if the agent could listen to container events as well.

@aaronbuchanan
Copy link
Author

Hi @jogster is this healthcheck lambda configuration defined anywhere? Route 53's health checks don't seem to supports SRV records (or any multi-answer DNS lookups)?

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants