-
Notifications
You must be signed in to change notification settings - Fork 489
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement a mechanism that to enable a NOS running SAI and SDK, to detect SDK and FW failures #1615
Conversation
Can we please add example usage? |
Intel/ashutosh: no way for nos to subscribe to specific ones they're interested in |
mike: this seems to take away control from NOS. |
better if nos doesn't need to parse string but the info is in a defined format. Please also checkout https://github.com/sonic-net/SONiC/blob/5677242770e3774a5eab0f951dc14642d81422ec/doc/event-alarm-framework/events-producer.md |
Community discussion comments :
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@itaibaz I like this proposal that this SDK failure detection and reporting is generic and gives the control to the vendor to push specific failure logs with related severity that help user to debug the given vendor SDK/HW. I think that if this feature is only severity level based it may resort to being an alternative to syslog. If there is additional categorization for the failures it may be more valuable to the user to monitor certain categories. Certain broad categories can be suggested. Given that using enums for categories seem restrictive and not having to add new enums for additional categories is one of your goals, please explore if we can make it extensible by adding the category as a field just like the description field.
d109ac8
to
256d47b
Compare
256d47b
to
79a75b0
Compare
79a75b0
to
2966cb3
Compare
opened #1777 instead |
Add new health event, for SAI to notify severity, timestamp and description of a SDK/FW health event
Signed-off-by: Andrii Mandiuk [email protected]