Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Project Proposal: Feature Flag Semantic Conventions #2042

Merged
merged 26 commits into from
May 31, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
26 commits
Select commit Hold shift + click to select a range
bebde4f
Add draft feature flag project proposal
dyladan Apr 5, 2024
848a7ca
Git handles don't work in markdown
dyladan Apr 5, 2024
6878459
Mike is a co-founder of openfeature
dyladan Apr 5, 2024
b9f24fd
Spell check
dyladan Apr 5, 2024
84b5278
Add ryan to list
dyladan Apr 7, 2024
0b2d0f5
Apply suggestions from code review
dyladan Apr 24, 2024
b5cc7f9
Code review
dyladan Apr 24, 2024
4992147
Merge branch 'main' into feature-flag-proposal
dyladan Apr 24, 2024
524f464
Add project sponsors
dyladan Apr 26, 2024
a25a5a4
Add shortcomings of existing experimental semconv
dyladan Apr 26, 2024
2b03d11
Spellcheck
dyladan Apr 26, 2024
3ba6f3e
Update projects/feature-flag.md
dyladan May 1, 2024
3384555
Update projects/feature-flag.md
dyladan May 9, 2024
349c23b
Add andre to spelling
dyladan May 9, 2024
1de0ba3
Add lukas and organize project staffing
dyladan May 9, 2024
ae1abe5
Merge branch 'main' into feature-flag-proposal
dyladan May 9, 2024
3a27694
Fix lukas' name spelling
dyladan May 10, 2024
8a3fde5
Add lukas and codecentric to cspell
dyladan May 10, 2024
899b949
Merge remote-tracking branch 'origin/main' into feature-flag-proposal
dyladan May 13, 2024
a32c028
Wording suggestions from review
dyladan May 13, 2024
4c6f66d
Merge branch 'main' into feature-flag-proposal
dyladan May 13, 2024
344fb4b
Merge branch 'main' into feature-flag-proposal
dyladan May 13, 2024
4d2190d
Add andre's employer
dyladan May 21, 2024
61a62bb
Merge remote-tracking branch 'origin/main' into feature-flag-proposal
dyladan May 21, 2024
d946f50
Merge branch 'main' into feature-flag-proposal
dyladan May 23, 2024
c888b1e
Merge branch 'main' into feature-flag-proposal
dyladan May 28, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
10 changes: 10 additions & 0 deletions .cspell.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -39,11 +39,13 @@ words:
- adot
- alolita
- amye
- andré
- aniszczyk
- apac
- anoshin
- armin
- beedgen
- beemer
- bertysentry
- blanco
- bogdan
Expand All @@ -56,6 +58,8 @@ words:
- cicd
- cijo
- cncf
- codeboten
- codecentric
- codecov
- codeowners
- danielgblanco
Expand Down Expand Up @@ -93,10 +97,15 @@ words:
- krzko
- kröhling
- kubecon
- kuisathaverat
- lalitb
- lambdanis
- lexis
- liatrio
- lightstep
- logz
- lucavallin
- lukas
- magnusbaeck
- maintainership
- mancuso
Expand All @@ -118,6 +127,7 @@ words:
- nakamura
- neumann
- neumüller
- nexis
- nirga
- nikimanoledaki
- novotny
Expand Down
145 changes: 145 additions & 0 deletions projects/feature-flag.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,145 @@
# Feature Flag Semantic Conventions

[OpenFeature](https://openfeature.dev/) is a CNCF Incubating project aiming to standardize feature flags.
They currently provide 11 open source SDKs in various languages and support more than 10 feature flagging backends.
OpenFeature has had contributions from over 50 companies and is actively developed by a combination of end users, feature flagging vendors, and platform providers (see the list of [interested parties](https://openfeature.dev/community/interested-parties)).

OpenFeature provides observability data via OpenTelemetry hooks.
A hook is an OpenFeature plugin which may execute logic at various stages of the feature flag evaluation process.
Currently, those hooks use the experimental [feature flagging semantic convention](https://opentelemetry.io/docs/specs/semconv/feature-flags/).

OpenFeature is looking to expand their observability solution either by improving the currently available hooks or by providing native instrumentation within the OpenFeature SDK itself.
In order to support that effort, OpenFeature requires a stable semantic convention to build instrumentation around.
This project aims to develop and stabilize feature flag semantic conventions, using the existing experimental semantic conventions as a starting point.

## Problem 1: Determining Flag Impact

Currently, it is impossible to determine the impact a feature flag has on key metrics for your services.
For example, if a feature flag is enabled for 25% of users, additional data is required to know if that subset of users is experiencing faster, slower, or unchanged response times.
This problem can become exponentially more challenging if multiple feature flags are evaluated over the course of a single trace.

Further, it may also be difficult or impossible to know exactly how many users are being affected by a particular feature flag.
For example, if a feature flag targets users with a particular email domain, it may require additional telemetry data to determine what portion of your users or traffic is receiving a particular experience.

## Problem 2: Flag management separate from telemetry data

Feature flags are typically managed outside your telemetry tooling.
There is no standardized way for a flag management system to notify a telemetry system that a feature flag rule set is changed, or if a rule is enabled or disabled.
This lack of standardization means that even if a flag management system emits a flag change event, the telemetry tool may not be able to correlate that change with the services that it impacts.
It also means that a person responsible for monitoring an application often has to check two separate tools in order to determine if a flag change was the root cause of a change in service metrics or behavior.
Because rule sets and telemetry data are managed in separate tools, it may be difficult to determine exactly which rule set was used and if it has been changed between when it was evaluated and when a person investigating an issue is looking at the telemetry data.

## Target Use Cases

- Analyze the impact a feature flag has on key service metrics such as response time and failure rate.
If a feature variant is enabled for some subset of users, it is necessary to be able to distinguish those traces from other traces in your telemetry backend.
- Associating a flag change with a change in key service metrics.
This is useful additional context during root cause analysis.
One example of when this is useful may be determining the rule set that was evaluated in order to determine the flag variant.
- Determine how many users are experiencing a particular flag variant.
In many feature flagging services it is possible to target a flag variant by some attribute of context such as the user’s email domain, however it may be difficult or impossible to know what percentage of your traffic falls into the cohort.
- Determining why a particular flag evaluation returned the variant that it did.
For example, a rule set may depend on contextual info such as the currently logged in user, their user-agent, or their geographic location in order to determine the flag variant.
- Identify which services are evaluating a specific feature flag.
This is useful for the process of removing flags from your code and feature flagging service, an important part of the flag lifecycle.
- Analyzing the impact of the actual flag evaluation.
Some feature flag libraries and services do async work such as database or remote calls to decide which variant of a feature flag should be returned.

## Experimental Semantic Conventions

Currently, the experimental [feature flagging semantic convention](https://opentelemetry.io/docs/specs/semconv/feature-flags/) defines a minimal set of attributes to track a flag evaluation: key, provider name, and the returned variant, and shows how they can be used in logs and span events.
While this provides a lot of value, there are several shortcomings that we would like to address:

### Shortcoming 1: key may not be unique

Many feature flag providers introduce the concept of a collection of feature flags used by the same service, which we will call projects.
While the flag key uniquely identifies a feature flag within a project or scope, it may not be globally unique across your application or service if your app evaluates flags from multiple projects.

### Shortcoming 2: lack of sufficient context

Most feature flag evaluation services support a concept called flag context.
Context is a set of attributes input into the flag rule set to influence the returned variant.
One example of this may be the current logged in users email domain; a rule set may turn a particular variant on for only users from a particular company.
The current experimental semantic conventions currently do not address this concept.

### Shortcoming 3: lack of metric support

It is common to analyze new features, A/B tests, and experiments using metrics split by feature flag data.
In the current semantic convention, there is no metric advice.
This leaves the user to decide which attributes to use on their own, which may lead to cardinality explosions or imprecise results.

### Shortcoming 4: no flag change events

This is related to key uniqueness in that a flag's rule set or possible variants may change over time.
These changes can have a massive impact on user experience and it is important to be able to view them in context of your observability data for effective root cause analysis.
Further, without a revision ID or similar concept, it may be difficult to determine exactly which rule set was evaluated to return a particular feature flag variant.

## Deliverables

The project deliverable will be a stable feature flag semantic convention for flag evaluations, also called impressions, and feature flag changes.
Both client and server uses should be considered, and prototypes should be created for at least one server and at least one client use case.

While prototypes will be delivered as a part of the development effort of this project, stable instrumentation is out of scope of this project and is expected to come from the OpenFeature project in the form of OpenFeature hooks or native instrumentation in OpenFeature SDKs.
Any use of baggage, including propagating fields to be used in feature flag evaluation context, is also out of scope.

## Staffing
dyladan marked this conversation as resolved.
Show resolved Hide resolved

The project is expected to be mainly staffed by OpenFeature contributors representing a strong subsection of the feature flagging vendor and user community.
This group will provide the domain expertise to ensure the feature flag observability meets the requirements of feature flag vendors and users.

This project also includes Dan Dyla acting as one of the project leads.
Dan is a member of the GC, maintainer of OTel JS, and has experience developing semantic conventions.
Dan will provide the necessary expertise to ensure the proposed semantic conventions meet the standards and expectations of the OpenTelemetry community and semantic conventions SIG.

The project is sponsored by Dan Dyla and Alexander Wert, an OpenTelemetry semantic conventions maintainer.

### Required staffing

**Project Leads and Sponsors:**

- Alexander Wert - Project Sponsor (OTel SemConv, Elastic)
- Daniel Dyla - Project Lead and Sponsor (OTel GC, OTel JS, Dynatrace)
- Michael Beemer - Project Lead (OpenFeature co-founder and GC, Dynatrace)

**Engineers:**

The following is a list of engineers committed to working on the project and write and review prototypes:

- Alexander Wert - (OTel SemConv, Elastic)
- André Silva (OpenFeature dotnet, Lexis Nexis Risk Solutions)
- Daniel Dyla - (OTel GC, OTel JS, Dynatrace)
- Federico Bond (OpenFeature Python, Independent)
- Lukas Reining (OpenFeature TC, Codecentric)
- Michael Beemer - (OpenFeature co-founder and TC, Dynatrace)
- Ryan Lamb (OpenFeature TC, LaunchDarkly)

## Meeting Times

This project expects to meet every 2 weeks until this charter is fulfilled.
The exact time and date of the meeting will be determined after the project is approved.

## Timeline

We expect the project to take about 1 calendar quarter. The OpenFeature project is currently targeting Kubecon NA 2024 to announce stable OpenTelemetry support.

- 2-4 weeks - First experimental semantic convention built on pre-existing feature flag semantic conventions
- 2-4 weeks - Prototype implementation and review
- 2-4 weeks - Incorporate feedback from prototype implementers
- 2-4 weeks - Final review period including the greater spec and maintainer community

## Labels

This project will use the label [`area:feature-flag`](https://github.com/open-telemetry/semantic-conventions/labels/area%3Afeature-flag) to track issues and pull requests in the semantic conventions repository.

## Project Board

Once the project is approved a project board will be created and linked here.
The project lead and all other relevant project members should have edit access to the board.
The board will be pre-populated with at least the following issues to track deliverables and the timeline.

- **signal type for feature flag impressions** - agree on which signal type or types should be used to represent feature flag impressions.
- **signal type for feature flag changes** - agree on which signal type or types should be used to represent feature flag changes.
- **semantic conventions for feature flag impressions** - make any necessary additions to the semantic conventions to support feature flag impressions
- **semantic conventions for feature flag changes** - make any necessary additions to the semantic conventions to support feature flag change events
- **prototype feature flag impressions**
- **prototype feature flag change events** - feature flag change events may not be generated by SDKs or instrumentations, but by the feature flag management tools.
dyladan marked this conversation as resolved.
Show resolved Hide resolved