[UMBRELLA] Falco collaboration with CNCF `tag-env-sustainability` #2435

incertum · 2023-03-06T20:28:49Z

Motivation

Falco would like to partner with https://github.com/cncf/tag-env-sustainability in order to improve Falco's efficiency (reduce compute overhead and resolve resource constraints limitations). This includes overcoming design challenges with new thinking in order to enable Falco to further extend threat detection capabilities w/ resource utilization budgets in mind.

Additional Context

EDIT Dec 19, 2023

New dedicated repo is up https://github.com/falcosecurity/cncf-green-review-testing/.
Checkout the open issues https://github.com/falcosecurity/cncf-green-review-testing/issues for tracking.

mkorbi · 2023-03-26T16:45:47Z

Hey @incertum, we would like to support you here.
First we will have to define a base line so that in the future you will have a measurable outcome. I opened some days ago the matching request for that method: cncf/tag-env-sustainability#64 (comment)

So we can get started here and then move on. WDTY?

Next steps would be to work out how to define the SCI for falco.

incertum · 2023-03-27T05:55:45Z

Hi @mkorbi, amazing ❤️!

SCI scores and anything related to it is new to me. Eager to learn how we can define the SCI for Falco. Previously, we focused on traditional resource utilization and health metrics (e.g. CPU and memory usage, event or event drop rates ...).

CC @falcosecurity/core-maintainers

incertum · 2023-06-16T21:18:50Z

@mkorbi Falco 0.35.0 is out featuring a new metrics option. By Falco 0.36.0 the metrics feature will transition into a stable state.

Following the discussion in cncf/tag-env-sustainability#64, we have a few questions:

cncf/tag-env-sustainability#64 (comment)
cncf/tag-env-sustainability#64 (comment)
However, it would be great to start collecting data on what we can already measure (CPU, GPU, memory), as @TheFoxAtWork said.

This would benefit use cases like Falco. CPU utilization is directly tied to the rate of events collected, which can be influenced by configurations. However, it is also dependent on the workload's nature, which is beyond Falco's control. Falco now supports measuring CPU utilization, event rates, and eBPF rate of tracepoint invocations natively.

cncf/tag-env-sustainability#64 (comment)
... deliverable to be an initial guide in evaluating resource consumption for projects in a default configuration so that interested projects can receive such an evaluation from this TAG ...

What could the expected deliverables for Falco look like? One idea is to provide adopters with a mathematical equation focused on overall CPU and/or memory utilization. This equation would allow them to calculate an approximate cost and observe how the cost changes when adjusting Falco's monitoring configurations. This would enable adopters to make informed decisions about resource allocation and optimize their usage of Falco.

Adopters can choose between measuring CPU and memory of Falco separately or use Falco's native metrics feature.

In addition, Falco follows a strict badging system across its repositories. Could see benefits to including TAG Environmental Sustainability engagement badge for our project ... WDYT? This badge would recognize our commitment to promoting and incorporating sustainable practices within the Falco community.

leonardpahlke · 2023-06-20T11:03:03Z

Hey @incertum, congrats on the latest release!

As part of TAG ENV, we are establishing a working group that will first investigate and then guide future projects like Falco and other CNCF projects to track their Cloud Native Sustainability footprint from release to release. The WG charter is currently discussed, but as soon as it's up, this group will focus on this issue. cc @guidemetothemoon and @nikimanoledaki

--

Regarding your comments and questions. There are two topics we are mixing in this discussion:

First, we want to make sure we incorporate cloud native sustainability in the development of our software. This is one is focused on maintainers building the open source software. It's about reporting, possible audits at some point, and enhancing the release process (adding a badge to the repo etc…).
Secondly, we would like to enable transparency to users to check on the cloud native sustainability footprint. This is aimed at the end users of the software to best configure the project for their needs and understand the tradeoffs in configuration and overall application.

Both are important, but we should not mix it in discussions. The TAG scope overarches both. Both rely on the same metrics to make assessments. Hearing about your latest release, that features metrics, is great 👍.

The obvious next question is, which metrics we care about. That's a larger topic. And the WG will look into this more detailed. In essence, if we talk just about metrics, we care about energy usage. If the space matures further, we will care about natural resources too, but on a system level, so this would not apply to a project like Falco. Energy usage it is. We also need to investigate energy effectiveness (not just energy efficiency, but being “mindful” of energy “invested”). In most cases, we cannot measure the usage directly and need to use correlations like $ cost or map it with vCPU etc. The more accurately we can measure, the better are our estimates, right.

Let's circle back, if we “test bench” the project (first topic 1. mentioned) we have information on the system underneath. We don't have to go through Falco to measure the energy usage. We just have to record which parameters we adjust (total events, event kinds, etc.) in Falco and map it. For end users, this may not be the case since and user experience also comes into play. We may want to split this scope into two initiatives (1. & 2.) which are both related (would love to hear your thoughts @TheFoxAtWork).

Since this is the first time the TAG is working with a project to assess their cloud native sustainability footprint, I expect that this will be a great learning experience :D. I am excited!

catblade · 2023-06-20T14:25:00Z

Would there be a possibility of presenting FALCO on one of the TAG meetings, so we can learn more?

incertum · 2023-06-20T16:11:40Z

Thank you @leonardpahlke and @catblade! happy to join one of the next TAG meetings.

Meanwhile, you might want to consider exploring this proposal on kernel version testing, which offers additional insights into why a kernel monitoring tool differs from other software. One notable distinction is that resource utilization depends on the actual workload and kernel settings of adopters, both of which are unpredictable factors for Falco developers. Consequently, I agree that enabling ...

@leonardpahlke

"transparency to users to check on the cloud native sustainability footprint. This is aimed at the end users of the software to best configure the project for their needs and understand the tradeoffs in configuration and overall application."

would be particularly beneficial for Falco.

Traditional CPU and memory usages are typically top of mind for SREs. Therefore, if we could derive energy consumption from those measurements, it would be highly appreciated.

That being said, happy to investigate and gather additional or different metrics.

TheFoxAtWork · 2023-06-20T21:02:58Z

There are a few items here worth considering (and indeed Falco is a different sort of cloud native project that makes this tricky but incredibly worthwhile as a first project to explore this with) (apologies if its a bit rambly - both the points, while generally separate, are more interrelated for projects like Falco due to what they do and less on how they do it, but i'd be happy to have this proven otherwise)

This could likely be accomplished by leveraging the testing infrastructure the project has in place and plans to have in place - effectively supporting the right size for their needs. Efficient tracking of the Project in an execution environment with a few types of workloads and common kernel settings would provide good visibility for a baseline. Something like a 2x2 matrix/table to record Low and High interaction workloads and two common kernel settings (evaluated for each) is a good initial start for expressing baseline. Once a baseline is established, next steps may be looking over the ruleset to identify which rulesets are most intensive and which aren't (in testing and when running), then comparing to the value they provide adopters (the latter coming from the Falco team). After which a more concrete discussion on efficient versus valuable rules could be undertaken by the Project and potentially mark rules accordingly for adopters or update the maturity framework to include an "efficient, core-value" set.
Having Falco provide transparency in its utilization for production environments is beneficial and it gives adopters a self-service option. Potential future improvements here could be Falco recommending which rules need tuned by the adopter as they are producing excessive noise and burning utilization above an identified threshold.

Lets look at the information available to us that doesn't details a specific provider or deployment environment if we can (since utilization/consumption measurements are wildly different) and focus on how the project is developed (primarily test infrastructure) and how it is commonly deployed (harder with Falco).

Somethings I expect to have confirmed:

Security tools are going to be computationally intensive due to the kinds of interactions they monitor and the rigor by which they are executed - anything we can do to guide adopters into more eco-conscious decisions without compromising security detections will improve the current state.
There are a limited number of ways to efficiently detect all the things adopters care about and largely will vary use case to use case.

nikimanoledaki · 2023-07-01T10:31:08Z

@catblade
Would there be a possibility of presenting FALCO on one of the TAG meetings, so we can learn more?

@incertum could you open a new issue using the Presentation template to do a short presentation at one of the upcoming regular meets, please? This will mainly be a discussion for TAG contributors to learn about Falco, get up to speed with the initiative discussed here, and discuss next steps.

Upcoming meets with available time include Wednesday 5th July & Wednesday 19th July. Meeting details can be found in the TAG's repo landing page. Thanks, looking forward to it! 🎉

incertum · 2023-07-01T18:10:07Z

Great, thank you! July 19th would be best.

catblade · 2023-07-03T02:52:52Z

I'll make sure to add you into the agenda this week if someone else doesn't get to it first. :-)

incertum · 2023-07-19T17:52:23Z

Updates July 19, 2023:

Here are the meeting notes https://docs.google.com/document/d/1TkmMyXJABC66NfYmivnh7z8Y_vpq9f9foaOuDVQS_Lo/edit#heading=h.5hquk4f1dn95, thanks @catblade!

Action Items on Falco side (ETA before Falco 0.36 release ~Sep 2023):

Create a test matrix, similar to Emily's suggestion [UMBRELLA] Falco collaboration with CNCF tag-env-sustainability #2435 (comment)
Falco project to make executive decisions on what desired benchmark test scenarios for scaling factors should look like, @catblade provided some initial pointers re possible synthetic workloads that could be of interest to us:
- https://github.com/delimitrou/DeathStarBench/blob/master/hotelReservation/README.md
- https://github.com/GoogleCloudPlatform/microservices-demo

Tracking tag-env-sustainability progress:

WG PR: https://github.com/cncf/tag-env-sustainability/pull/151/files, approx. current ETA more August or later in 2023, outcomes will guide currently open questions around guidance for adopters / desired UX to assess the utilization impact of a tool (here Falco) on their specific environments and constraints @guidemetothemoon
Getting CNCF resources on equinox clusters to host testbeds / benchmarks is in flight https://www.cncf.io/community-infrastructure-lab/ @nikimanoledaki
Kepler project (also eBPF powered) was suggested to measure consumption during benchmark tests on the dedicated testbed clusters https://sustainable-computing.io/design/power_estimation/#deployment-scenarios

incertum · 2023-12-19T21:28:58Z

Updates Dec 19, 2023:

New dedicated repo is up https://github.com/falcosecurity/cncf-green-review-testing/.
Checkout the open issues https://github.com/falcosecurity/cncf-green-review-testing/issues for tracking going forward.

Expected ETA for a complete v1 to be "live" by KubeCon EU 2024.

poiana · 2024-03-18T21:49:53Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh with /remove-lifecycle stale.

Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Provide feedback via https://github.com/falcosecurity/community.

/lifecycle stale

leogr · 2024-03-22T09:46:51Z

/remove-lifecycle stale

poiana · 2024-06-20T09:53:56Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh with /remove-lifecycle stale.

Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Provide feedback via https://github.com/falcosecurity/community.

/lifecycle stale

leogr · 2024-06-20T10:27:10Z

/remove-lifecycle stale

poiana · 2024-09-18T16:10:48Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh with /remove-lifecycle stale.

Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Provide feedback via https://github.com/falcosecurity/community.

/lifecycle stale

leogr · 2024-09-20T07:53:28Z

/remove-lifecycle stale

poiana · 2024-12-19T10:12:57Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh with /remove-lifecycle stale.

Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Provide feedback via https://github.com/falcosecurity/community.

/lifecycle stale

leogr · 2024-12-19T14:46:43Z

/remove-lifecycle stale

incertum added the kind/feature label Mar 6, 2023

leonardpahlke mentioned this issue Jun 20, 2023

[Proposal] Proof of Environmental Sustainability activities and best practices for CNCF projects cncf/tag-env-sustainability#64

Closed

incertum mentioned this issue Jul 1, 2023

[Presentation] Falco and TAG Environmental Sustainability Partnership cncf/tag-env-sustainability#140

Closed

3 tasks

incertum mentioned this issue Jul 21, 2023

The impact of falco on system performance #2683

Closed

incertum mentioned this issue Aug 28, 2023

Improve falco benchmarking, performance, and regression tooling to better track system resources impact #2296

Closed

guidemetothemoon mentioned this issue Aug 28, 2023

[Tracking, Green Reviews WG] Falco as first CNCF project to measure by Green Reviews WG cncf/tag-env-sustainability#183

Closed

Andreagit97 added this to the TBD milestone Aug 31, 2023

incertum mentioned this issue Dec 12, 2023

Create new repo cncf-green-review-testing for CNCF TAG Environmental Sustainability - Green Reviews WG Testing Integration falcosecurity/evolution#345

Closed

poiana added the lifecycle/stale label Mar 18, 2024

poiana removed the lifecycle/stale label Mar 22, 2024

poiana added the lifecycle/stale label Jun 20, 2024

poiana removed the lifecycle/stale label Jun 20, 2024

poiana added the lifecycle/stale label Sep 18, 2024

poiana removed the lifecycle/stale label Sep 20, 2024

poiana added the lifecycle/stale label Dec 19, 2024

poiana removed the lifecycle/stale label Dec 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[UMBRELLA] Falco collaboration with CNCF `tag-env-sustainability` #2435

[UMBRELLA] Falco collaboration with CNCF `tag-env-sustainability` #2435

incertum commented Mar 6, 2023 •

edited

Loading

mkorbi commented Mar 26, 2023

incertum commented Mar 27, 2023

incertum commented Jun 16, 2023

leonardpahlke commented Jun 20, 2023 •

edited

Loading

catblade commented Jun 20, 2023

incertum commented Jun 20, 2023

TheFoxAtWork commented Jun 20, 2023

nikimanoledaki commented Jul 1, 2023

incertum commented Jul 1, 2023

catblade commented Jul 3, 2023

incertum commented Jul 19, 2023

incertum commented Dec 19, 2023

poiana commented Mar 18, 2024

leogr commented Mar 22, 2024

poiana commented Jun 20, 2024

leogr commented Jun 20, 2024

poiana commented Sep 18, 2024

leogr commented Sep 20, 2024

poiana commented Dec 19, 2024

leogr commented Dec 19, 2024

[UMBRELLA] Falco collaboration with CNCF tag-env-sustainability #2435

[UMBRELLA] Falco collaboration with CNCF tag-env-sustainability #2435

Comments

incertum commented Mar 6, 2023 • edited Loading

mkorbi commented Mar 26, 2023

incertum commented Mar 27, 2023

incertum commented Jun 16, 2023

leonardpahlke commented Jun 20, 2023 • edited Loading

catblade commented Jun 20, 2023

incertum commented Jun 20, 2023

TheFoxAtWork commented Jun 20, 2023

nikimanoledaki commented Jul 1, 2023

incertum commented Jul 1, 2023

catblade commented Jul 3, 2023

incertum commented Jul 19, 2023

incertum commented Dec 19, 2023

poiana commented Mar 18, 2024

leogr commented Mar 22, 2024

poiana commented Jun 20, 2024

leogr commented Jun 20, 2024

poiana commented Sep 18, 2024

leogr commented Sep 20, 2024

poiana commented Dec 19, 2024

leogr commented Dec 19, 2024

[UMBRELLA] Falco collaboration with CNCF `tag-env-sustainability` #2435

[UMBRELLA] Falco collaboration with CNCF `tag-env-sustainability` #2435

incertum commented Mar 6, 2023 •

edited

Loading

leonardpahlke commented Jun 20, 2023 •

edited

Loading