Optimize Tekton Pipeline #5452

pritidesai · 2022-09-07T23:46:28Z

Tekton Pipeline has matured since the inception but at the same time, the project is under active development. Many organizations have adopted Tekton Pipelines for various use cases. For the project at this level of maturity and use, reliability must be maintained. The users should be able to upgrade their pipelines to the latest release without running into any performance degradation.

We have noticed a couple of issues reported with a similar concern around efficiency - webhook timing out or the cluster is not responsive for a pipeline with large number of tasks.

Today, we have no records of how much time a certain pipeline takes to execute with the latest release compared to N number of the past releases.

We have had a couple of PRs in the past trying to introduce some form of performance test:

Let's start writing performance tests to report the execution time. The performance tests can be scheduled to execute every night. We collect the execution time in logs for now until we come up with a better way of storing these numbers.

As a performance measure, we could also avoid validating task/pipeline spec in every iteration - #4562.

Determine if we can avoid validating specifications every reconcile cycle.
Create a test with a complex pipeline to log time taken to validate.
Write a test to create a pipelineRun with a complex pipeline (multiple tasks with taskRef and taskSpec along with many whenexpressions).
Create multiple taskRuns in parallel - something similar to RFC: Basic performance test #4378
Create multiple pipelineRuns in parallel and log timing.

The text was updated successfully, but these errors were encountered:

dibyom · 2022-09-14T21:54:51Z

This seems similar to tektoncd/community#602

JeromeJu · 2022-11-21T16:01:43Z

/assign

afrittoli · 2023-01-10T17:45:25Z

From the pipeline WG - it would be good to break this down in smaller items we can target to milestones.

lbernick · 2023-04-12T17:50:48Z

It seems like this issue is largely scoped to benchmarking. I did a bit recently to test out a feature I was working on and want to share my progress here.

I wrote some scripts that generate N copies of a PipelineRun, wait until all N are complete, and write timing info to a file. If the script is cancelled, it will cancel any currently running PipelineRuns and report on all of them regardless of whether they have completed. This could be a good starting point for anyone who wants to implement benchmarking. Code changes are on the branch https://github.com/lbernick/pipeline/tree/perftest.

Some things that still need to be figured out:

Where would we run this? We wouldn't want to generate lots of runs on our CI cluster, and I'm not sure if a kind cluster could handle a large number of PipelineRuns.
What's the best way to output perf data so that it can be stored over time and referred to easily? We might be able to run tests using Tekton, and store results using Tekton results; we could also use prometheus but I'm not sure how we'd easily separate out metrics related to benchmarking.
What metrics do we care about specifically? I think this is what Start measuring Tekton Pipelines performance #540 is trying to tackle (also some more detail in https://docs.google.com/document/d/1Rme6UQ0i03W_Fg3pefJ8aJ9G73IBnijUISTL2R9XmzU/ -- thanks @pritidesai!)

Some other tools we can look into:

tekton-robot · 2023-07-11T18:28:08Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale with a justification.
Stale issues rot after an additional 30d of inactivity and eventually close.
If this issue is safe to close now please do so with /close with a justification.
If this issue should be exempted, mark the issue as frozen with /lifecycle frozen with a justification.

/lifecycle stale

Send feedback to tektoncd/plumbing.

tekton-robot · 2023-08-10T19:17:48Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten with a justification.
Rotten issues close after an additional 30d of inactivity.
If this issue is safe to close now please do so with /close with a justification.
If this issue should be exempted, mark the issue as frozen with /lifecycle frozen with a justification.

/lifecycle rotten

Send feedback to tektoncd/plumbing.

vdemeester · 2023-08-10T20:23:59Z

/lifecycle frozen

afrittoli · 2024-04-30T16:57:03Z

@pritidesai - we marked this as "nice to have" for v1 - please let us know if you disagree.

pritidesai added the area/performance Issues or PRs that are related to performance aspects. label Sep 7, 2022

xchapter7x added this to Tekton Community Roadmap Sep 20, 2022

xchapter7x moved this to Todo in Tekton Community Roadmap Sep 20, 2022

lbernick added this to Pipelines V1 Oct 18, 2022

lbernick moved this to Todo in Pipelines V1 Oct 18, 2022

lbernick added this to the Pipelines v0.42 milestone Nov 7, 2022

jerop modified the milestones: Pipelines v0.42, Pipelines v0.43 Nov 15, 2022

tekton-robot assigned JeromeJu Nov 21, 2022

afrittoli modified the milestones: Pipelines v0.43, Pipelines v0.44 Dec 13, 2022

afrittoli removed this from the Pipelines v0.44 milestone Jan 10, 2023

JeromeJu removed their assignment Jan 17, 2023

lbernick mentioned this issue Jan 24, 2023

Nightly performance tests #6036

Closed

tekton-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 11, 2023

afrittoli mentioned this issue Jul 25, 2023

Regression testing on a scheduled duration #6969

Open

tekton-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Aug 10, 2023

tekton-robot added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. and removed lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. labels Aug 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize Tekton Pipeline #5452

Optimize Tekton Pipeline #5452

pritidesai commented Sep 7, 2022 •

edited

Loading

dibyom commented Sep 14, 2022

JeromeJu commented Nov 21, 2022

afrittoli commented Jan 10, 2023

lbernick commented Apr 12, 2023

tekton-robot commented Jul 11, 2023

tekton-robot commented Aug 10, 2023

vdemeester commented Aug 10, 2023

afrittoli commented Apr 30, 2024

Optimize Tekton Pipeline #5452

Optimize Tekton Pipeline #5452

Comments

pritidesai commented Sep 7, 2022 • edited Loading

dibyom commented Sep 14, 2022

JeromeJu commented Nov 21, 2022

afrittoli commented Jan 10, 2023

lbernick commented Apr 12, 2023

tekton-robot commented Jul 11, 2023

tekton-robot commented Aug 10, 2023

vdemeester commented Aug 10, 2023

afrittoli commented Apr 30, 2024

pritidesai commented Sep 7, 2022 •

edited

Loading