Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Documentation / deployment walk through for SPIRE on TPM. #5291

Open
edwbuck opened this issue Jul 11, 2024 · 1 comment
Open

Documentation / deployment walk through for SPIRE on TPM. #5291

edwbuck opened this issue Jul 11, 2024 · 1 comment
Labels
help wanted Issues with this label are ready to start work but are in need of someone to do it priority/backlog Issue is approved and in the backlog

Comments

@edwbuck
Copy link
Contributor

edwbuck commented Jul 11, 2024

This issue is to create a deployable, maintainable, scalable, fault-tolerant documented SPIRE reference architecture that relies on TPM as a root of trust for the sole purpose of node attestation. There are two forms that this documentation might take, one using TPM without the presence of Kubernetes, and one with the presence of Kubernetes. This issue only deals with the non-Kubernetes deployment.

Being a hardware based deployment, the helm chart project cannot be a component of one form of this documentation.

The goals of this effort is to provide

  • One example, from the ground up, that establishes a redundant SPIRE cluster using the boxboat TPM node attestor.
  • Procedures around establishing the initial trust for node attestation.
  • Procedures around re-establishing trust for nodes down for a short period of time.
  • Procedures around re-establishing trust for nodes down for extended (certificate expiring) periods of time.
  • Procedures for scaling up to an additional SPIRE server
  • Procedures for scaling down, removing a SPIRE server

As a side effect of this documentation, a number of follow-on issues will likely be drafted to facilitate the ease of deployment.

Where possible, the documentation should focus on fast recovery of outage events. A nonfunctional requirement of the documentation should be to minimize the effort for supporting a 100 or 1000+ node data center.

@kfox1111
Copy link
Contributor

This is a good thing to do, in addition to the other issue. I think they are complimentary. This one is focused on the NodeAttestor part, and the other one is focused on the dependencies of the spire-server with respect to other CA's.

@evan2645 evan2645 added the triage/in-progress Issue triage is in progress label Jul 18, 2024
@amartinezfayo amartinezfayo added priority/backlog Issue is approved and in the backlog unscoped The issue needs more design or understanding in order for the work to progress and removed triage/in-progress Issue triage is in progress labels Jul 24, 2024
@MarcosDY MarcosDY added help wanted Issues with this label are ready to start work but are in need of someone to do it and removed unscoped The issue needs more design or understanding in order for the work to progress labels Jul 30, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Issues with this label are ready to start work but are in need of someone to do it priority/backlog Issue is approved and in the backlog
Projects
None yet
Development

No branches or pull requests

5 participants