Added test timeout #211

MeirShpilraien · 2023-12-10T14:55:00Z

The PR adds a new option --test-timeout that allows set a test timeout (in seconds) after which the test will be considered as failed. The timeout works as follow:

Open a watcher thread that sleep for the given timeout
Once wake up, check if the test that is been watched finished, if not:
- Send a SIGUSR1 to the main processes thread that will cause it to print the current trace and enter a deep sleep.
- Shutdown the environment using SIGSEGV to make sure we will get all the needed information from the shards
- Print verbose information if needed (in case --verbose-information-on-failure was used).
- Exit the processes using os._exit(1). Notice that it is important to exit using os._exit(1), if we exit in any other way, python might wait for active connections or thread to be close. We are killing the processes in the middle of its execution and we have no idea in which state it hang, so we prefer to wait for nothing.

For backward compatibility, the default timeout is 0 which implies no timeout.

Notice, when choosing the best way to trigger a timeout, 2 approaches was tested:

Using a background watcher thread.
Using signal.setitimer.

Eventually, the first approach was chosen, mainly because if we use signal.setitimer, the test itself might also use it and override our timer and callback. I believe the thread approach is safer and more reliable.

Extra additions/fixes:

Progress bar indicating how many tests finished and how much is still left to run. Progress bar can be turned off using --no-progress. Progress bar will automatically turned off if --no-output-catch was used or if the stdout in not a terminal (output was redirected).
Fix issue where on some cases the same test would have appear multiple time in the summary report.

Technical Low Level Details on Progressbar

Till today, when running with parallelism on more than one. Each processes reported its own progress. The PR changes this approach in way that only the main processes reports the progress and each sub-processes reports to the main processes. This gives us 2 main adventages:

Avoid print collisions between processes
Ability to report progress

To achieve this, each sub-processes introspects its own stdout and send the tests output to the main processes on a new channel called results. When the main processes gets a message on the results channel, it prints its connect to the stdout and increase the progress bar.

When running without parallelism, the output is printed to the stdout right away.

To avoid code duplication with the parallel and the none parallel flow, we extracted the code that runs a single tests to its own function, run_single_test, and we call it from the 2 different flows: run_jobs_main_thread, run_jobs.

codecov-commenter · 2023-12-10T15:48:17Z

Codecov Report

Attention: 211 lines in your changes are missing coverage. Please review.

Comparison is base (409e17c) 34.16% compared to head (2eb4f1b) 32.28%.

Files	Patch %	Lines
RLTest/__main__.py	0.00%	183 Missing ⚠️
RLTest/redis_std.py	10.71%	25 Missing ⚠️
RLTest/redis_cluster.py	33.33%	2 Missing ⚠️
RLTest/env.py	50.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #211      +/-   ##
==========================================
- Coverage   34.16%   32.28%   -1.89%     
==========================================
  Files          17       17              
  Lines        2409     2565     +156     
==========================================
+ Hits          823      828       +5     
- Misses       1586     1737     +151

Flag	Coverage Δ
unittests	`32.28% <2.31%> (-1.89%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

GuyAv46 · 2023-12-11T09:30:16Z

RLTest/__main__.py

+        for i in range(num_elements):
+            if bar:
+                bar.update(i)
+            yield i
+        if bar:
+            bar.update(num_elements)


shouldn't this be indented? and maybe give up the if bar: then if ProgressBar cannot return None

GuyAv46 · 2023-12-11T10:01:04Z

RLTest/__main__.py

@@ -558,15 +627,12 @@ def addFailure(self, name, failures=None):
            failures = [failures]
        if not failures:
            failures = []
-        self.testsFailed.append([name, failures])
+        self.testsFailed.setdefault(name, []).extend(failures)

    def getTotalFailureCount(self):


rename functions

GuyAv46 · 2023-12-11T10:33:33Z

RLTest/__main__.py

                currPort += 30 # safe distance for cluster and replicas
                processes.append(p)
                p.start()
+            for _ in self.progressbar(n_jobs):
+            # for _ in range(n_jobs):


GuyAv46 · 2023-12-11T10:36:06Z

RLTest/__main__.py

+                    except Exception as e:
+                        if not has_live_processor:
+                            raise Exception('Failed to get job result and no more processors is alive')
+                _ = res['test_name']


GuyAv46 · 2023-12-11T10:37:49Z

RLTest/__main__.py

+                for test_name, failures in res['failures'].items():
+                    self.testsFailed[test_name] = failures


Suggested change

for test_name, failures in res['failures'].items():

self.testsFailed[test_name] = failures

self.testsFailed.update(res['failures'])

Added test timeout

ba4e0f8

MeirShpilraien requested review from alonre24 and GuyAv46 December 10, 2023 14:55

Update poetry.lock

16905b4

MeirShpilraien mentioned this pull request Dec 10, 2023

MOD-6023: Print running tests #198

Closed

MeirShpilraien added 3 commits December 10, 2023 17:19

Notify the watcher thread when test ends.

06b6cc1

Only enable progressbar when stdout is terminal

de15edb

Avoid progress bar update if not exists

2eb4f1b

GuyAv46 reviewed Dec 11, 2023

View reviewed changes

Review fixes

76511b5

MeirShpilraien requested a review from GuyAv46 December 11, 2023 12:05

GuyAv46 approved these changes Dec 11, 2023

View reviewed changes

MeirShpilraien merged commit b5295ee into master Dec 11, 2023
23 checks passed

MeirShpilraien deleted the test_timeout branch December 11, 2023 14:41

tezc mentioned this pull request Dec 17, 2023

Use fixed version of RLTest v0.7.5 RedisTimeSeries/RedisTimeSeries#1552

Merged

MeirShpilraien mentioned this pull request Dec 18, 2023

Disallow combining parallel execution with --no-output-catch #213

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added test timeout #211

Added test timeout #211

MeirShpilraien commented Dec 10, 2023 •

edited

Loading

codecov-commenter commented Dec 10, 2023 •

edited

Loading

GuyAv46 Dec 11, 2023

GuyAv46 Dec 11, 2023

GuyAv46 Dec 11, 2023

GuyAv46 Dec 11, 2023

GuyAv46 Dec 11, 2023

		for test_name, failures in res['failures'].items():
		self.testsFailed[test_name] = failures

	for test_name, failures in res['failures'].items():
	self.testsFailed[test_name] = failures
	self.testsFailed.update(res['failures'])

Added test timeout #211

Added test timeout #211

Conversation

MeirShpilraien commented Dec 10, 2023 • edited Loading

Technical Low Level Details on Progressbar

codecov-commenter commented Dec 10, 2023 • edited Loading

Codecov Report

GuyAv46 Dec 11, 2023

Choose a reason for hiding this comment

GuyAv46 Dec 11, 2023

Choose a reason for hiding this comment

GuyAv46 Dec 11, 2023

Choose a reason for hiding this comment

GuyAv46 Dec 11, 2023

Choose a reason for hiding this comment

GuyAv46 Dec 11, 2023

Choose a reason for hiding this comment

MeirShpilraien commented Dec 10, 2023 •

edited

Loading

codecov-commenter commented Dec 10, 2023 •

edited

Loading