-
-
Notifications
You must be signed in to change notification settings - Fork 666
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Patterns for managing expensive spec setup #1108
Comments
hey @nabbas-ca I wanted to pull this out into a separate issue so we can dig into it a bit more. this comes up from time to time - particularly for kubernetes suites with complex setups. If you're up for it I'd love to explore the problem more with a few goals in mind:
Would you be up for digging into things at a deeper level with me? If so I'll write up some thoughts/questions on this issue. (If that sounds really abstract/too-much-work-just-give-me-the-answer-please allow me to clarify: I've bumped into this problem space a few times with k8s e2e users and often times the question is framed in terms of a preferred solution - e.g. "do you have BeforeAll" or "can I have multiple suites in one package" and what i'd like to do is better understand the problem so I can figure out the right thing to suggest and/or build. V2 actually introduces an implementation of |
@onsi I'm up for this. I've designed it with BeforeAll at the end. But would like to discuss if it makes more sense in Gingko. |
Great! Sorry for the length, but here goes! Ginkgo is, currently, tailored towards suites where specs are independent from one another. Independent specs are usually easier to reason about and can, usually, be trivially parallelized which can yield substantial performance gains. For example: Ginkgo's integration suite takes ~4 minutes when I run it serially but only ~45 seconds when I run Of course, when you're building integration suites performance considerations are definitely in play. If the state under test is very expensive to set up (either in terms of time orcomputed resources) it can be hard to write independent specs. There are three common patterns that are in use today (these aren't the only things out there - you can mix and match and get more elaborate - but they provide a good frame of reference to dig into the topic). Per-Spec Setup and TeardownIn some integration contexts you can simply set up the entire world under test before each spec (typically in a Dedicated Resources Per Parallel ProcessIn some integration contexts it can be too expensive to set up and tear down the resources for each spec. But you can afford to give each parallel process its own dedicated resource (setup in a This makes parallelization fairly straightforward. You just need to make sure you correctly reset the resource after each spec (typically done in an I'll sometimes do this if I'm testing code that talks to an external database. I usually have enough compute resources to spin up multiple databases and just point each parallel process as its own dedicated database. Cleanup then is as simple as dropping the entire database between test runs - which can usually be done pretty efficiently. Shared Resources for the SuiteFinally, in some contexts, it isn't feasible to have dedicated resources and so a single shared resource must be used. This is usually set up in a As long as the specs are exercising things that only affect the single tenancy they inhabit - these sorts of specs are trivially parallelizable. However sometimes there are destructive specs that need to run that could affect other specs running in parallel. For these, Ginkgo provides the If you've gotten this far - thanks for hanging in there! When it comes to tests on a system like Kubernetes there are multiple layers of resources. And we can reason about each one separately. There's:
Which kind of approach works for each of these? And what are the tenancy capabilities available for each of them? I don't have a lot of direct k8s experience but I'm pretty confident that a per-spec setup and teardown of a fresh k8s cluster is out of the question. My sense, also, is that k8s clusters are sufficiently large and expensive that most users don't spin up a dedicated cluster per process. Instead, most opt for option 3: a shared k8s cluster for the entire suite + k8s tenancy constructs (namespaces?) to ensure specs can play without influencing each other. I don't know enough about controllers/operators and workloads/CRs to know if it is feasible for them to be set-up/torn-down on a per-spec basis or if they need to be done in a dedicated or shared way. I don't know enough about how tenancy works in that situation. I would assume that k8s is efficient enough that for your case: 1- clean controller, no CRs preloaded that you could simply write specs for 1 that setup and tear down a clean controller with no CRs preloaded in a Same with 2 and 3 - could you preload good/bad CRs into separate namespaces then set up the controllers after they are created - all in a Often times folks will reach for So, some questions:
|
I wanted to chime in that I have exactly the same issue, in the same context of Kubernetes operators. For me, the situation is basically this:
Currently, there seems to be no good way of achieving this other than moving things into different packages, but there's a certain taste to being forced to organize code differently because you can't separate the test setup well enough. One thought that crossed my mind was that it'd be kinda neat if the suite could select only tests that carry some label. We want to label tests using |
hey @Airblader First a quick comment:
I do (personally, subjectively) tend to prefer separating unit tests from integration tests (admittedly the terms can be squishy - I've written some very integrationy unit tests and vice versa). Two reasons for this preference:
Given that I've (again, personally, subjectively) preferred to keep them separate and in Go that means separate packages. That opinion certainly informs Ginkgo's design and default usage. But Ginkgo isn't particularly opinionated about this at all, and has evolved over the years to get pretty flexible. So...
You can do precisely this. And some relatively new stuff that landed in 2.8 makes it even more straightforward. You can use Labels to annotate your specs with metadata and then run // same entry point as always
func TestOperator(t *testing.T) {
RegisterFailHandler(Fail)
RunSpecs(t, "Operator Suite")
}
// pseudocode - but the idea is `client` is how you interact with the cluster
var client *k8s.Client
// we use SynchronizedBeforeSuite to only spin up the e2e cluster on one Ginkgo process
// we then pass the connection details to all processes so they can connect to the cluster
var _ = SynchronizedBeforeSuite(func() []byte {
var connectionDetailsJSON []byte = nil
// this tests to see if the current configured `-label-filter` will select tests labeled integration
// if so we spin up the cluster
if Label("integration").MatchesLabelFilter(GinkgoLabelFilter()) {
cluster := envtest.SpinUpCluster() // pseudo-code
DeferCleanup(cluster.Teardown) // this will clean up once all the tests complete
connectionDetailsJSON, _ = json.Marshal(cluster.ConnectionDetails)
}
return connectionDetailsJSON
}, func(connectionDetailsJSON []byte) {
//connectionDetailsJSON will be non-nil only if the integration tests are enabled
if connectionDetailsJSON != nil {
var connectionDetails k8s.ConnectionDetails
json.Unmarshal(connectionDetailsJSON, &connectionDetails)
//client will be non-nil
client = k8s.NewClient(connectionDetails)
}
})
var namespacedClient *k8s.NamespacedClient //pseudocode, obviously
// each integration test needs its own namespace, we accomplish that with a BeforeEach...
var _ = BeforeEach(func() {
// ...but we only want to set up the namespace if this is an integration test.
// so we check and if the current test has the integration label.
matches, _ := CurrentSpecReport().MatchesLabelFilter("integration");
if matches && client != nil {
namespacedClient = client.CreateRandomNamespace()
DeferCleanup(namespacedClient.Teardown)
}
}, OncePerOrdered) // we want the namespace to exist throughout the duration of an Ordered container if we use any
var _ = Describe("When a resource is requested", func() {
Context("these are integration tests", Label("integration"), func() {
It("makes a resource", func() {
client.MakeResource(...)
Eventually(http.Get).WithArguments("http://localhost:8080/resource").Should(HaveField("StatusCode", http.StatusOK))
})
It("lists existing resources", func() {
client.MakeResource(...)
Eventually(client.ListResources).Should(HaveLen(1))
})
})
Context("these are unit tests", func() {
It("handles the case where the resource request is malformed", func() {
Expect(operator.New(...)).To(MatchError(operator.SYNTAX_ERROR))
})
It("uses sane defaults when not specified", func() {
o := operater.New(...)
Expect(o.WidgetCount).To(BeNumerically("<", 50))
})
})
It("this is a standalone integration tests", Label("integration") func() {
//...
})
}) Now you can run all the tests (in parallel) with: ginkgo -p You can run just the unit tests with: ginkgo -p -label-filter="\!integration" and all the integration tests with: ginkgo -p -label-filter="integration" (a simple makefile or some other scripting solution will help you not have to retype all those things every time). The cluster and namespace will only be made when the integration tests will run. |
@onsi I like where this is going. However, I just want to have multiple suites running with different setups:
All of these residing in the same package as BeforeSuite (or SynchronizeBeforeSuite) and each spec , at any level of the node , to specify which suite to use as setup. default to Suite1 for example. Can that be possible somehow? |
I've been trying to create a public repo to demonstrate this, not yet done. |
hey @nabbas-ca - can i ask you to step back and frame this less in terms of a potential solution and more in terms of the problem you are trying to solve. (eg "i'd like to have the same tests point at different kinds of k8s clusters so i can validate that things work regardless of environment") that might help me brainstorm potential solutions for you. |
(i'm just concerned that i'm misunderstanding you and want to check the actual underlying problem statement) |
Yeah, I like to write unit tests for the operator (trying to shift the testing as left as possible, since we can mock a lot in golang and ginkgo) Here is how I would like to structure my unit tests in a sample operator (initialized and scaffolded by operator-sdk): 1- in package api/v1alpha1, webhooks are defined and created(operator-sdk scaffolding). Test Strategy:
2- in package controllers, controllers are defined. Test Strategy:
All of this, while structuring the unit test files to be adjacent to the code being unit tested. I like this because the code that runs the unit tests is adjacent, and can serve as code sample for usage of these methods at the same time. |
I apologize for the misformatting, i don't know how it got that way. |
OK - that's a fair bit. I'll try to help but first I need to better understand k8's
What I'm left wondering is: can we do all this, in parallel, with one cluster. Or do we need to have multiple clusters, one for each class of test. This is where my lack of knowledge of k8s isolation semantics comes into play. Let me be super concrete and specific: Can you register a controller/webhook with a particular k8s namespace? Can different namespaces with different configurations of controller/webhook exist on the same cluster? And will k8s ensure they are isolated? Moreover, is the registering of a controller/webhook reasonably efficient? If the answer to all this is "basically, yes" then I would structure the Ginkgo integration tests such that:
If the answer to these questions is "the isolation works, but setting up controllers is expensive" then I would do something like:
If the answer to these questions is "the isolation will not work" then I suspect the simplest solution will be to have multiple Ginkgo suites, each one spinning up its own We can dig into the rest of it further - but lets pause there and stress test some of these answers. What are your thoughts? |
also - if it would help - i'd be happy to find time for a zoom call since there's a lot of complexity here! |
I could definitely setup a zoom call for this (or webex if it is ok). I'm in Toronto, which is 2 hours ahead of you probably. How does your calendar look? I can answer some of the k8s questions here as well:
I like the idea of multiple envtests in the setup. I think that would solve all my problems. I could have a wrapper around each envTest as a test Suite. |
sweet - this is starting to make more sense to me. based on what you’re saying it does sound like you can get pretty far with one (or perhaps a couple of well-defined) shared cluster(s) and then have a few tests that need their own cluster on demand.
yes, give it a try and see how far you can push it!
I’m not following what this means, though. I’m wondering if what I mean by “test suite” (which is a very specific notion in Ginkgo) and what you mean might be different. Would be happy to do zoom/webex. I’m in Denver so, yeah - 2 hours behind you. I’m free much of tomorrow between 9 and 2 Denver time (so 11 and 4 your time). i’m on both the kubernetes and gophers slack instances so you can dm me there (handle is |
hey @Airblader - in case you ended up building off of that example. I had forgotten that you can do: CurrentSpecReport().MatchesLabelFilter("integration") I've updated the example with that. @nabbas-ca let me know if you'd like to connect or if you've been able to build of some of the idea we discussed. |
In case there's interest, please take a look at the new proposal for managing shared resources and having finer-grained control over parallelism here: #1292 |
On #130 @nabbas-ca wrote:
I want to write k8s controller/operator tests within 2 suites, one with envtest controlller manager initialized and listening, and one without. So I can have finegrained negative path testing. For example, I want to to be able to setup a mock k8s api with bad CRs(before the webhook is alive or registered), and see how my controller reacts to correct it , as unit tests.
This way, I can setup a set of suites, within the same package like this:
1- clean controller, no CRs preloaded
2- preloaded good CRs, setup controller after the CRs are created
3- preloaded bad CRs, setup controller after the CRs are created
.....
Is that possible? remember that envTest is expensive to start. making a BeforeEach at top level of tests that includes manager start, could be possiblle, but it means starting and stopping manager for every test.
The text was updated successfully, but these errors were encountered: