-
Notifications
You must be signed in to change notification settings - Fork 67
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Define support ticket urgency levels and practices for how to deal with them #1154
Comments
I think an important first step is to define an incident, in an objective way that's independent of how Urgent the user thinks something is a problem. An incident is when one of the following is true:
I think when any of these are true, we should 'declare an incident'. https://sre.google/workbook/incident-response/ has some good ideas on what to do when that happens, inspired by actual fire incidents in the literal wild. |
Taking a page out of https://sre.google/workbook/incident-response/#putting-best-practices-into-practice, so here's a very specific but first-draft process recommendation for an incident management workflow. When a ticket comes in, we perform the following test:
If any of these criteria are met, the support steward declares an incident, by doing the following:
The incident commander is responsible for investigating the issue, pulling in people if necessary, and keeping the customer informed via the support ticket. They can also tag out when it is no longer working hours for them - we should engineer process that makes this viable. How does this sound as a start? I can make this into a PR and we can iterate. |
https://response.pagerduty.com/before/different_roles/ is also a good read. |
Couple more vague thoughts:
|
So, we are thinking about having 2 people in support AND one incident commander from the rest of the team? |
I think this is closed by 2i2c-org/team-compass#422 |
I think there are some additional pieces on this merged PR: 2i2c-org/docs#143 @choldgraf, do you want to keep this open for something else? |
Let's say that 2i2c-org/team-compass#422 closes this one, and we can continue iterating in new / more focused issues from there? |
Context
An important part of the support process is understanding how "urgent" the ticket is. For example - some tickets are general requests that we can finish in a few weeks, others are immediate fires that we need to fix ASAP. Having a system to categorize these tickets will help us make better decisions about our operational time, and reduce stress levels associated with not knowing whether we need to drop everything to fix something.
Proposal
We should define:
As a result of this, we may need to do further development work to improve our support practices, such as setting up a PagerDuty-like system, but we'll understand that better once we write out the high-level structure.
Implementation guide
See the parent issue for a lot of references to support processes with urgency levels:
The text was updated successfully, but these errors were encountered: