chore(api): Do not log healthcheck error if downfile exists #6635

untitaker · 2024-12-10T18:42:15Z

Kubernetes tells the old snuba pods to mark itself as unhealthy, so that
envoy stops sending traffic to it.

Snuba reports itself as unhealthy to Sentry.

Gocd checks Sentry for new errors.

Gocd stalls the deployment because there are new errors.

Kubernetes tells the old snuba pods to mark itself as unhealthy, so that envoy stops sending traffic to it. Snuba reports itself as unhealthy to Sentry. Gocd checks Sentry for new errors. Gocd stalls the deployment because there are new errors.

volokluev · 2024-12-10T18:48:47Z

snuba/utils/health_info.py

+        if not down_file_exists:
+            logger.error(f"Snuba health check failed! Tags: {metric_tags}")


I would just info log it if the down file exists, it's useful log info, just not an error

untitaker · 2024-12-10T19:06:38Z

I found this additional health_envoy check. If we already check the downfile for envoy this way, what other services need to react to the downfile? Could they use the same endpoint instead?

snuba/snuba/web/views.py

Lines 212 to 213 in 120c358

    
           @application.route("/health_envoy") 
        
           def health_envoy() -> Response:

untitaker · 2024-12-10T19:08:44Z

snuba/utils/health_info.py

@@ -107,7 +107,10 @@ def get_health_info(thorough: Union[bool, str]) -> HealthInfo:
    payload = json.dumps(body)
    if status != 200:
        metrics.increment("healthcheck_failed", tags=metric_tags)
-        logger.error(f"Snuba health check failed! Tags: {metric_tags}")
+        if down_file_exists:
+            logger.error("Snuba health check failed! Tags: %s", metric_tags)


Moving away from f-strings to logger-native string formatting. The reason we got a new issue at all is because we were interpolating the string before logging. Now we only get one issue instead of N for every possible tag combination.

This works because then it's not a "new" issue in Sentry, is that correct?

yes that's the goal. now it's grouping by the literal string "Snuba health check failed! Tags: %s"

MeredithAnya · 2024-12-10T19:45:29Z

snuba/utils/health_info.py

+        if down_file_exists:
+            logger.error("Snuba health check failed! Tags: %s", metric_tags)
+        else:
+            logger.info("Snuba health check failed! Tags: %s", metric_tags)

    if status != 200 or down_file_exists:
        logger.info(payload)


@untitaker do we need this log then? if you added the check above?

untitaker requested a review from a team as a code owner December 10, 2024 18:42

volokluev reviewed Dec 10, 2024

View reviewed changes

restore info log, and use different formatter

30282d4

untitaker commented Dec 10, 2024

View reviewed changes

untitaker requested a review from volokluev December 10, 2024 19:31

MeredithAnya reviewed Dec 10, 2024

View reviewed changes

remove extra log

0729bff

untitaker requested a review from MeredithAnya December 11, 2024 20:54

evanh approved these changes Dec 12, 2024

View reviewed changes

untitaker merged commit 5fc3d7d into master Dec 12, 2024
31 checks passed

untitaker deleted the healthcheck-error branch December 12, 2024 17:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(api): Do not log healthcheck error if downfile exists #6635

chore(api): Do not log healthcheck error if downfile exists #6635

untitaker commented Dec 10, 2024

volokluev Dec 10, 2024

untitaker commented Dec 10, 2024

untitaker Dec 10, 2024

evanh Dec 12, 2024

untitaker Dec 12, 2024

MeredithAnya Dec 10, 2024

		if not down_file_exists:
		logger.error(f"Snuba health check failed! Tags: {metric_tags}")

chore(api): Do not log healthcheck error if downfile exists #6635

chore(api): Do not log healthcheck error if downfile exists #6635

Conversation

untitaker commented Dec 10, 2024

volokluev Dec 10, 2024

Choose a reason for hiding this comment

untitaker commented Dec 10, 2024

untitaker Dec 10, 2024

Choose a reason for hiding this comment

evanh Dec 12, 2024

Choose a reason for hiding this comment

untitaker Dec 12, 2024

Choose a reason for hiding this comment

MeredithAnya Dec 10, 2024

Choose a reason for hiding this comment