-
Notifications
You must be signed in to change notification settings - Fork 84
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Whitepaper on cloud-native observability #16
Comments
That sounds like a really good idea to me and I'm happy to contribute to this. |
+1 to this. A good first step may be a landscape, just to have all "players", from open-source and cloud-native, all the way to hosted SaaS options. |
We would need to define the categories for the landscape first. Observability is a too broad and abstract term. |
I am also happy to contribute to this. |
@mhausenblas, @sferlin, @ArthurSens, and @danielkhan met today to agree on a rough outline and schedule. We plan to have the draft ready for review by January 19th and a final version in February. |
I've finished reorganizing the sessions. Several comments were added to the doc with a brief explanation about the newly added sessions. |
It's worth mentioning that the organization of the sessions was inspired by the whitepaper that is under work by the SIG-Security |
"Goals" and "Target Audience" ready for review :) |
"Introduction" ready for review 🙂 |
A couple of thoughts...
|
Thank you for the feedback, Jaana.
I could not participate as it got unexpectedly busy but I can take a stab on this within the next two weeks if no one else is already working on this. |
The coordination and collaboration is happening in Google Doc and Slack, using this issue for high level tracking. |
Hi everyone, I would love to contribute to this. I see the last actions were last taken in April 2021. Where is the Whitepaper at in the process? |
Hi all, hope you are all great! Thanks everybody for help and feedback. It's finally time to claim that v1.0 version of the whitepaper we started long time ago! To do so we have to perform final touches and review. For that, we created this document, which is now open for review. The aim is to have the final version of whitepaper with addressed TODOs done by 1st August. After that we will share it wider, save as official v1 and open WIP version for v1.1, so we can continuously evolve it with more content and updates community can bring next months! (: Final review document holds the latest whitepaper version (copied from this version) available for collaborative review and addressing TODOs. Feel free to add comments & suggestions. For bigger suggestions (e.g. further details or new sections), we might pull them new GH issues for whitepaper v1.1. The doc also outlines all feedback items we got so far to consider. Feel free to help in any of those TODOs and, generally, in final review of this paper 🤗 Thanks! 💪🏽 |
Is "Make your Kubernetes cluster - and the apps running on it - obseravable" mentioned above still a goal of the observability whitepaper? |
Not sure, it feels like a separate tutorial might be useful here (we can then link from whitepaper). Happy to be told otherwise here (: Added #131 to track that particular idea. |
All the points here were either addressed in the final 1.0 review period or added as TODO (help wanted!) and tracked in the individual issues with PR with the changes from the review period to main branch will follow. Closing for now, feel free to keep discussion going, ideally in separate issue so it's easier to track and address 💪🏽 Thanks everybody for epic work on reviewing, suggesting and contributing! |
See: #132 |
Thanks for your ideas and feedback in https://github.com/cncf/tag-observability/issues/16\ Signed-off-by: bwplotka <[email protected]>
…ion period. (#132) * whitepaper: Syncing changes from the 1.0 community review & contribution period. Thanks everyone for amazing feedback! Apologies for a bit short period, but the paper was sitting for 2y without changes, so it made sense to time box 1.0 and allow structured work on further iterations. Still, within [this community review & contribution period](https://docs.google.com/document/d/19am_KCYWU28ebLiIXv_P3ji96edxCTscVb4CzemXV5A/edit) I counted 67 individual contributions (count of comment/suggestion bubbles, excluding my own and not counting individual discussion comments) from 7 new contributors. High level changes (suggested by community, but also clean up by me): * Note on aggregatability and volume of metrics. * Added non goals * Changing example from temperature to memory gauge * Add reasons for metric efficiency * Added info about cardinality (new section) * Mentioned metric data models * Added info about types * Metric time series vs count * Addressing feedback on logs, traces * Adding profile screenshot * Cleaning up, simplifying correlation section * Removing how to setup Prometheus with exemplars * Transitions * Box based monitoring refactor - changing "closed box" traditional * Clarified SLO/SLA * Added image and figure captions * More automatic and non-intrusive instrumentation solutions in OSS * Linking ebay paper * Added gap around streaming API, not enough DBs and standarized query language * Did grammarly pass for typos. ...and more. As mentioned in [the doc](https://docs.google.com/document/d/19am_KCYWU28ebLiIXv_P3ji96edxCTscVb4CzemXV5A/edit) I went through all additional and old feedback. It's now either addressed in this PR or added as todos in [separate issues](https://github.com/cncf/tag-observability/issues?q=is%3Aissue+is%3Aopen+label%3Acn-o11y-whitepaper-v1.1) I admit, it was fun to process that doc! Reminds me of year ago when I was, fully focused, writing my [book](https://www.oreilly.com/library/view/efficient-go/9781098105709/) on sligthly different topic. Signed-off-by: bwplotka <[email protected]> * Added Jaana and Alois as contributors. Thanks for your ideas and feedback in https://github.com/cncf/tag-observability/issues/16\ Signed-off-by: bwplotka <[email protected]> * Apply suggestions from Richi's code review Co-authored-by: RichiH-travel <[email protected]> Signed-off-by: Bartlomiej Plotka <[email protected]> * Apply suggestions from code review Co-authored-by: RichiH-travel <[email protected]> Signed-off-by: Bartlomiej Plotka <[email protected]> * Fixed references, added tip version. Signed-off-by: bwplotka <[email protected]> --------- Signed-off-by: bwplotka <[email protected]> Signed-off-by: Bartlomiej Plotka <[email protected]> Co-authored-by: RichiH-travel <[email protected]>
…ion period. (cncf#132) * whitepaper: Syncing changes from the 1.0 community review & contribution period. Thanks everyone for amazing feedback! Apologies for a bit short period, but the paper was sitting for 2y without changes, so it made sense to time box 1.0 and allow structured work on further iterations. Still, within [this community review & contribution period](https://docs.google.com/document/d/19am_KCYWU28ebLiIXv_P3ji96edxCTscVb4CzemXV5A/edit) I counted 67 individual contributions (count of comment/suggestion bubbles, excluding my own and not counting individual discussion comments) from 7 new contributors. High level changes (suggested by community, but also clean up by me): * Note on aggregatability and volume of metrics. * Added non goals * Changing example from temperature to memory gauge * Add reasons for metric efficiency * Added info about cardinality (new section) * Mentioned metric data models * Added info about types * Metric time series vs count * Addressing feedback on logs, traces * Adding profile screenshot * Cleaning up, simplifying correlation section * Removing how to setup Prometheus with exemplars * Transitions * Box based monitoring refactor - changing "closed box" traditional * Clarified SLO/SLA * Added image and figure captions * More automatic and non-intrusive instrumentation solutions in OSS * Linking ebay paper * Added gap around streaming API, not enough DBs and standarized query language * Did grammarly pass for typos. ...and more. As mentioned in [the doc](https://docs.google.com/document/d/19am_KCYWU28ebLiIXv_P3ji96edxCTscVb4CzemXV5A/edit) I went through all additional and old feedback. It's now either addressed in this PR or added as todos in [separate issues](https://github.com/cncf/tag-observability/issues?q=is%3Aissue+is%3Aopen+label%3Acn-o11y-whitepaper-v1.1) I admit, it was fun to process that doc! Reminds me of year ago when I was, fully focused, writing my [book](https://www.oreilly.com/library/view/efficient-go/9781098105709/) on sligthly different topic. Signed-off-by: bwplotka <[email protected]> * Added Jaana and Alois as contributors. Thanks for your ideas and feedback in https://github.com/cncf/tag-observability/issues/16\ Signed-off-by: bwplotka <[email protected]> * Apply suggestions from Richi's code review Co-authored-by: RichiH-travel <[email protected]> Signed-off-by: Bartlomiej Plotka <[email protected]> * Apply suggestions from code review Co-authored-by: RichiH-travel <[email protected]> Signed-off-by: Bartlomiej Plotka <[email protected]> * Fixed references, added tip version. Signed-off-by: bwplotka <[email protected]> --------- Signed-off-by: bwplotka <[email protected]> Signed-off-by: Bartlomiej Plotka <[email protected]> Co-authored-by: RichiH-travel <[email protected]> Signed-off-by: Roman Nikolaev <[email protected]>
Goal is to support users in implementing observability and monitoring for their cloud native workloads
Target: End users building cloud native applications
Scope: Define basic concepts of data collection and analysis and how CNCF projects can be used for this. Maybe add 1 - 3 real world reference examples
Details:
The text was updated successfully, but these errors were encountered: