Catalogue cache improvements #329

domcleal · 2015-10-22T09:44:35Z

Fixes two issues in the catalogue cache:

Clones the facts hash to stop the _timestamp fact (dynamically generated by Puppet 3.x) from being part of the cache key. This was preventing the catalogue cache from having any effect on adjacent examples on 3.x. Puppet 4 no longer adds this fact.
Limits the size of the cache, preventing memory exhaustion when many different sets of facts and parameters were tested.

Originally reported at #215 (comment).

I'm not sure about the choice of 16 catalogues. I was tempted to only cache a single catalogue, which would allow adjacent examples (with identical parameters) to get the benefit of caching, but I suppose this might allow for some caching in more complex specs with non-linear layouts of examples (e.g. testing parameter A, then parameter B, then parameter A again).

In Puppet 3, the Puppet::Node::Facts class adds a _timestamp fact to the facts hash, causing constant catalogue cache invalidation between spec examples. This was removed in Puppet 4 via PUP-3130. By calling #dup on the facts hash, the facts in the cache key are no longer modified and the catalogue cache works as expected.

DavidS · 2015-10-22T09:47:56Z

spec/classes/catalogue_cache_spec.rb

+      end
+    end
+
+    (16..20).each do |i|


shouldn't that be (5..20) to cover all bases?

It should be (9..20) or even (9..20) + (1..4), as the 1..20 will have invalidated the first four, then 1..4 will have invalidated the next four. What's left should be the original 9..20 (since the first eight are now invalid), plus 1..4.

DavidS · 2015-10-22T09:50:08Z

lib/rspec-puppet/support.rb

-      @@cache[args] ||= self.build_catalog_without_cache(*args)
+      unless @@cache.has_key? args
+        # Keep only the most recently added 16 entries to prevent high memory consumption
+        expire_cache(@@cache, @@cache_lra, 15)


It makes me sad that the API here requires the caller to do math in their head.

Ah, the off-by-one error?

Yes.

PS: "The two big problems in computer science: Cache Invalidation, Naming, and Off-by-one Errors."

I've moved the line down so that it's below the new addition to the cache, meaning 16 can be passed into expire_cache. Since it's the most recently added, there's no danger of it being immediately expired again.

Heh, two of three in one PR's not bad going!

we could bikeshed the name :P

The catalogue cache, based on facts, code etc, previously kept all known catalogues in a class variable without expiration. Large specs with many permutations (e.g. multiple OSes, different class or Hiera parameters) caused many catalogues to remain in memory and the test process grew to a large size. This limits the cache to keep only the last 16 entries, which should allow for a bit of catalog reuse within a single spec file.

DavidS · 2015-10-22T11:02:33Z

I like the code change, and I'll try it out tonight. Then merge it.

domcleal · 2015-10-22T12:09:29Z

Thanks for reviewing David. My own unscientific test on Travis CI for one of our modules just completed with interesting results - most runs are significantly quicker and a couple of runs that I think were being OOM killed are now passing.

Before: https://travis-ci.org/theforeman/puppet-foreman/builds/85269582
After: https://travis-ci.org/domcleal/puppet-foreman/builds/86802997

igalic · 2015-10-22T12:44:13Z

@domcleal why on earth are you tests running for 5 hours?!

domcleal · 2015-10-22T12:46:29Z

@igalic I think that's totalling the runtime from all parts of the matrix. They're quite large test suites which use rspec-puppet-facts to run examples on each supported OS. The wall time was closer to 1-2 hours.

DavidS · 2015-10-22T15:04:51Z

This improved the local runtime of the puppetlabs-apache spec tests from 11 minutes 32 seconds to slightly over 8 minutes.

Excellent catch, @domcleal , thanks!

Catalogue cache improvements

DavidS reviewed Oct 22, 2015
View reviewed changes

domcleal mentioned this pull request Oct 22, 2015

Cached catalogs + hiera-puppet-helper #215

Closed

DavidS reviewed Oct 22, 2015
View reviewed changes

domcleal force-pushed the cache branch from ceb24b4 to 5c54210 Compare October 22, 2015 10:07

domcleal mentioned this pull request Oct 22, 2015

Rspec helper functions theforeman/foreman-installer-modulesync#17

Merged

DavidS added a commit that referenced this pull request Oct 22, 2015

Merge pull request #329 from domcleal/cache

7659172

Catalogue cache improvements

DavidS merged commit 7659172 into rodjek:master Oct 22, 2015

mmoll mentioned this pull request Nov 25, 2015

new release? #332

Closed

domcleal mentioned this pull request Nov 28, 2015

Remove catalogue cache instance variables after each example #333

Merged

alexjfisher mentioned this pull request Jul 11, 2020

Disable RSpec/MultipleExpectations voxpupuli/modulesync_config#658

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Catalogue cache improvements #329

Catalogue cache improvements #329

domcleal commented Oct 22, 2015

DavidS Oct 22, 2015

domcleal Oct 22, 2015

DavidS Oct 22, 2015

DavidS Oct 22, 2015

domcleal Oct 22, 2015

DavidS Oct 22, 2015

domcleal Oct 22, 2015

domcleal Oct 22, 2015

igalic Oct 22, 2015

DavidS commented Oct 22, 2015

domcleal commented Oct 22, 2015

igalic commented Oct 22, 2015

domcleal commented Oct 22, 2015

DavidS commented Oct 22, 2015

Catalogue cache improvements #329

Catalogue cache improvements #329

Conversation

domcleal commented Oct 22, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

DavidS commented Oct 22, 2015

domcleal commented Oct 22, 2015

igalic commented Oct 22, 2015

domcleal commented Oct 22, 2015

DavidS commented Oct 22, 2015