Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Access for data-catalogue github actions #4413

Merged
merged 1 commit into from
May 29, 2024

Conversation

MatMoore
Copy link
Contributor

@MatMoore MatMoore commented May 28, 2024

Pull Request Objective

We want to schedule Datahub DBT ingestions using github actions. (ministryofjustice/data-catalogue#123)

To do this, Github actions needs to be able to assume a role via OIDC and use it to read the bucket containing CaDeT outputs. See https://docs.aws.amazon.com/IAM/latest/UserGuide/id_roles_create_for-idp_oidc.html

We already had defined IRSAs (IAM roles for service accounts) for this, that can be assumed by Datahub itself, but these assume you are running an application in a kubernetes pod on AWS, whereas in this case we are going to run the ingestion from github actions.

Checklist

Note

Each items should be checked. Skipping below checks could delay your PR review!

  • I have reviewed the style guide and ensured that my code complies with it
  • All checks have passed (or override label applied, if I've used the override-static-analysis label, I've explained why)
  • I have self-reviewed my code
  • I have reviewed the checks and can attest they're as expected

@MatMoore MatMoore force-pushed the find-moj-data-access-to-cadet-outputs branch from a7d272a to b768e5f Compare May 28, 2024 10:50
Copy link
Contributor

github-actions bot commented May 28, 2024

Terraform Component 🧱: aws-analytical-platform-data-production-tooling-iam

Checkov 🛂: success

Trivy 🛂: success

Static Analysis Override Label 🏷️: false

Pusher: @MatMoore, Action: pull_request, Working Directory: terraform/aws/analytical-platform-data-production/tooling-iam, Workflow: Terraform, Marker: aws-analytical-platform-data-production-tooling-iam_static_analysis

@MatMoore MatMoore force-pushed the find-moj-data-access-to-cadet-outputs branch 4 times, most recently from 2067a5e to 2bd782d Compare May 28, 2024 11:07
Copy link
Contributor

github-actions bot commented May 28, 2024

Terraform Component 🧱: aws-analytical-platform-data-production-tooling-iam

Terraform Initialization ⚙️: success

Terraform Validation 🤖: success

Terraform Plan 🛠️: success

Pusher: @MatMoore, Action: pull_request, Working Directory: terraform/aws/analytical-platform-data-production/tooling-iam, Workflow: Terraform, Marker: aws-analytical-platform-data-production-tooling-iam_plan

@MatMoore MatMoore marked this pull request as ready for review May 28, 2024 11:10
@MatMoore MatMoore requested a review from a team as a code owner May 28, 2024 11:10
jacobwoffenden
jacobwoffenden previously approved these changes May 28, 2024
Copy link
Member

@jacobwoffenden jacobwoffenden left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

We want to schedule Datahub DBT ingestions using github actions.
(ministryofjustice/data-catalogue#123)

To do this, Github actions needs to be able to assume a role via OIDC,
and use it to access the s3 bucket containing the outputs from DBT.
See https://docs.aws.amazon.com/IAM/latest/UserGuide/id_roles_create_for-idp_oidc.html

We already had IRSAs (IAM roles for service accounts) which can be assumed by Datahub itself,
but these assume you are running an application in a kubernetes pod on
AWS, whereas in this case we are going to run the ingestion from github
actions.
@MatMoore MatMoore force-pushed the find-moj-data-access-to-cadet-outputs branch from 2bd782d to 2e7dc47 Compare May 29, 2024 08:19
Copy link
Member

@jacobwoffenden jacobwoffenden left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@MatMoore MatMoore changed the title Access for find-moj-data github actions Access for data-catalogue github actions May 29, 2024
@MatMoore MatMoore merged commit ce5fdd2 into main May 29, 2024
14 checks passed
@MatMoore MatMoore deleted the find-moj-data-access-to-cadet-outputs branch May 29, 2024 09:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants