Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

🔍 Define Alerts for our Modernisation Platform and route them #2235

Closed
6 tasks
Tracked by #1590
julialawrence opened this issue Nov 6, 2023 · 1 comment
Closed
6 tasks
Tracked by #1590
Labels
data-platform-apps-and-tools This issue is owned by Data Platform Apps and Tools enhancement enhancing an existing feature 🧐 Monitoring and Observability (Epic #1590)

Comments

@julialawrence
Copy link
Contributor

julialawrence commented Nov 6, 2023

User Story

We need to establish a robust alerting system for our Modernisation Platform and set up proper routing for these alerts. The goal is to ensure that we can proactively identify and respond to any potential issues or anomalies in our platform's performance and operation. This issue focuses on defining the criteria for triggering alerts and creating a routing mechanism to direct these alerts to the appropriate teams or individuals for timely resolution.

Definition of Done

  • Alert Criteria Definition: Clearly specify the conditions or events that should trigger alerts. This could include performance
    thresholds, error rates, security breaches, or other critical events.
  • Alert Severity Levels: Determine different severity levels for alerts (e.g., critical, major, minor) to prioritize responses
    accordingly.
  • Alert Routing Mechanism: Create a system that routes alerts to the relevant stakeholders or teams responsible for
    addressing specific types of alerts.
  • Communication and Escalation Plan: Develop a clear plan for how alerts will be communicated, escalated, and resolved,
    ensuring that the right people are informed in a timely manner.
  • Testing and Validation: Ensure that the alerting system is thoroughly tested and validated to avoid false positives or
    negatives, which can disrupt operations.
  • Documentation: Document the entire alerting and routing process for future reference and as a reference guide for team
    members.
@julialawrence
Copy link
Contributor Author

Nice story but OBE'd :(

@julialawrence julialawrence closed this as not planned Won't fix, can't repro, duplicate, stale Mar 5, 2024
@github-project-automation github-project-automation bot moved this from 👀 TODO to 🎉 Done in Analytical Platform Mar 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
data-platform-apps-and-tools This issue is owned by Data Platform Apps and Tools enhancement enhancing an existing feature 🧐 Monitoring and Observability (Epic #1590)
Projects
Archived in project
Development

No branches or pull requests

1 participant