Improve test reproducibility + Decaffeinate #45

confused-Techie · 2022-09-04T06:37:45Z

While working on the tests, we had the goal to not break any more tests than already were, which were previously stated as 21 Failed tests for Package Testing, and 51 Failed tests for Editor Testing.

But trying to reproduce this seemed iffy. After seeing that none of our current open PR's matched these numbers in a meaningful way, I started some testing.

We did all testing below on the master branch, on NodeJS 16.XX.XX (16.8.0 on Linux, and 16.16.0 on Windows)

And all testing was done right after the other with no changes to the code between.

Below are the results: (For the editor only)

Linux:

2165 tests, 13816 assertions, 52 failures, 10 skipped
2165 tests, 12382 assertions, 50 failures, 10 skipped
2165 tests, 12112 assertions, 50 failures, 10 skipped
2165 tests, 12346 assertions, 50 failures, 10 skipped
2165 tests, 11896 assertions, 50 failures, 10 skipped

Windows:

2167 tests, 12124 assertions, 32 failures, 8 skipped
2167 tests, 12731 assertions, 33 failures, 8 skipped
2167 tests, 12556 assertions, 34 failures, 8 skipped
2167 tests, 12556 assertions, 156 failures, 8 skipped
2167 tests, 12587 assertions, 31 failures, 8 skipped
2167 tests, 12044 assertions, 33 failures, 8 skipped
2167 tests, 12299 assertions, 33 failures, 8 skipped

As you can see above there is a pretty significant variability to the status of these tests. In trying to find out why, there was on simple change made to the test runner itself, that got much more stable results, at least on Windows. Decaffeinate the test runner. By simply doing this, Windows tests were run consecutively varying from 33 - 34 failed tests, after over 15 runs.

With this singular change we can improve the reliability of the tests, and ideally afterwards can set out exact numbers for the amount of tests that are allowed to break, or alternatively, deny PR's until we have a passing status in all tests, to make them easy and simple to rely on.

(Although for the latter option we also would need to sort out what stops many GitHub Action runs from installing successfully.)

mauricioszabo

Assuming that we didn't break any new tests, go ahead :)

confused-Techie · 2022-09-05T21:48:10Z

Assuming that we didn't break any new tests, go ahead :)

Thanks, looking at the tests, they are within the expected failure rate:

Ubuntu Editor: 53
Ubuntu Package: 22

Mac Editor: Truncated
Mac Package: 22

Windows Editor: Failed to Install
Windows Package: Failed to Install

So I'll go ahead and merge in a moment!

DeeDeeG · 2022-09-11T22:54:31Z

Super late comment:

I think anywhere in the Atom repos, the CoffeeScript (tooling) version being used has generally been 1.x, and there was a "breaking changes" 2.x edition of CoffeeScript released at some point after.

So it may be advantageous to convert CoffeeScript to JS ("decaffeinate") with CoffeeScript 1.x tooling, rather than CoffeeScript 2.x.

If there are any issues experienced after decaffeinating with CoffeeScript 2.x, worth a try with CoffeeScript 1.x instead.

e.g. upstream Atom seems to be using the [email protected] package? I assume that's what powers Atom's ability to interpret CoffeeScript files on the fly, and I further assume that includes e.g. the test CoffeeScript files.

https://github.com/atom/atom/blob/17a31e3a3729070768f31bbce7ce9bcc09f5a2b8/package.json#L46

Edit to add: The JS code generated by CofeeScript 2.x is probably nicer and more modern? But if it breaks that is probably not worth the nicer prettier more-modern code. Functional is probably priority # 1, modernity and readability and elegant code second. This is all vaguely from memory, and I don't really use CoffeeScript or its tooling (coffeescript command) very often.

confused-Techie · 2022-09-11T23:05:03Z

@DeeDeeG this is a very helpful insight. And actually points to an exact issue we had earlier. Where things broke by decaffeination. So thanks for the insight, and I'll ensure to redo this PR with CoffeeScript 1.x tooling. Thank you!

confused-Techie added 3 commits September 3, 2022 23:21

Decaffeinate

ff35488

Create Docs

8442124

Add windows user note to building.md

7b6aedb

confused-Techie mentioned this pull request Sep 5, 2022

Debounce spy #50

Merged

mauricioszabo approved these changes Sep 5, 2022

View reviewed changes

confused-Techie merged commit d36c898 into master Sep 5, 2022

confused-Techie deleted the no-brittle-tests branch September 5, 2022 21:59

Daeraxa mentioned this pull request Sep 11, 2022

Update Decaffeination Guidelines pulsar-edit/.github#12

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve test reproducibility + Decaffeinate #45

Improve test reproducibility + Decaffeinate #45

confused-Techie commented Sep 4, 2022 •

edited

Loading

mauricioszabo left a comment

confused-Techie commented Sep 5, 2022

DeeDeeG commented Sep 11, 2022 •

edited

Loading

confused-Techie commented Sep 11, 2022

Improve test reproducibility + Decaffeinate #45

Improve test reproducibility + Decaffeinate #45

Conversation

confused-Techie commented Sep 4, 2022 • edited Loading

mauricioszabo left a comment

Choose a reason for hiding this comment

confused-Techie commented Sep 5, 2022

DeeDeeG commented Sep 11, 2022 • edited Loading

confused-Techie commented Sep 11, 2022

confused-Techie commented Sep 4, 2022 •

edited

Loading

DeeDeeG commented Sep 11, 2022 •

edited

Loading