Fix test_logging_to_driver and test_not_logging_to_driver #5462

raulchen · 2019-08-16T04:17:29Z

Why are these changes needed?

These 2 tests are flaky on CI. Because sometimes the previous autoscaler tests will start background threads and print the following errors to stderr.

Traceback (most recent call last):
  File "/home/travis/miniconda/lib/python2.7/threading.py", line 801, in __bootstrap_inner
    self.run()
  File "/home/travis/build/ray-project/ray/python/ray/autoscaler/updater.py", line 151, in run
    raise e
AssertionError: Unable to SSH to node

The purpose of these tests should be to verify that the logs are redirected (or not redirected) to driver stdout. So there's no need to check stderr. However, we should also fix the issue of not stopping autoscaler background threads in a different PR.

What do these changes do?

Related issue number

Linter

I've run scripts/format.sh to lint the changes in this PR.

AmplabJenkins · 2019-08-16T08:36:54Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/16326/
Test FAILed.

robertnishihara · 2019-08-16T18:19:49Z

@raulchen @ericl how hard would it be to make sure the autoscaler test shuts down properly? That seems like the right fix.

robertnishihara · 2019-08-16T18:21:49Z

python/ray/tests/test_basic.py

@@ -2649,8 +2649,6 @@ def f():
    output_lines = captured["out"]
    for i in range(200):
        assert str(i) in output_lines
-    error_lines = captured["err"]
-    assert len(error_lines) == 0


This makes me realize that this line should have been

assert len(error_lines) == 0, error_lines

so that we can what the stderr was in the case of error

robertnishihara · 2019-08-16T18:23:21Z

python/ray/tests/test_basic.py

@@ -2649,8 +2649,6 @@ def f():
    output_lines = captured["out"]
    for i in range(200):
        assert str(i) in output_lines
-    error_lines = captured["err"]


If we remove this check, then we should include a comment that explains what goes wrong if we do check it. Since people (myself included) will be very tempted to bring back this check.

robertnishihara · 2019-08-16T18:23:38Z

@raulchen @ericl how hard would it be to make sure the autoscaler test shuts down properly? That seems like the right fix.

robertnishihara · 2019-08-16T18:24:40Z

python/ray/tests/test_basic.py

@@ -2649,8 +2649,6 @@ def f():
    output_lines = captured["out"]
    for i in range(200):
        assert str(i) in output_lines


Unrelated to the test failure, but in this test we should really be checking that we don't get any unintended log messages (or duplicates). Especially since @stephanie-wang saw some duplicates recently.

raulchen · 2019-08-17T07:06:25Z

@raulchen @ericl how hard would it be to make sure the autoscaler test shuts down properly? That seems like the right fix.

I agree that autoscaler issue should be fixed. But I'm not familiar with autoscaler and don't know how to fix that.
Looking at this test_logging_to_driver test, I think its purpose is to verify that the worker logs will be sent to driver stdout. So I think this test doesn't need to care about stderr. And a more accurate way to test this is to mock the print_logs functions, instead of just checking stdout output.
But for now, I think this PR is enough for unblocking the CI first.

robertnishihara

@raulchen this looks good to me. I pushed an additional comment. Does that look good to you?

raulchen · 2019-08-17T07:44:23Z

@robertnishihara thanks. looks good

AmplabJenkins · 2019-08-17T11:48:30Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/16363/
Test PASSed.

Fix test_logging_to_driver and test_not_logging_to_driver

ad952a4

raulchen requested a review from robertnishihara August 16, 2019 04:17

robertnishihara reviewed Aug 16, 2019

View reviewed changes

Update test_basic.py

8323190

robertnishihara approved these changes Aug 17, 2019

View reviewed changes

raulchen merged commit 03d05c8 into ray-project:master Aug 17, 2019

raulchen deleted the fix_logging_tests branch August 17, 2019 10:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix test_logging_to_driver and test_not_logging_to_driver #5462

Fix test_logging_to_driver and test_not_logging_to_driver #5462

raulchen commented Aug 16, 2019 •

edited

Loading

AmplabJenkins commented Aug 16, 2019

robertnishihara commented Aug 16, 2019

robertnishihara Aug 16, 2019

robertnishihara Aug 16, 2019

robertnishihara commented Aug 16, 2019

robertnishihara Aug 16, 2019

raulchen commented Aug 17, 2019

robertnishihara left a comment

raulchen commented Aug 17, 2019

AmplabJenkins commented Aug 17, 2019

Fix test_logging_to_driver and test_not_logging_to_driver #5462

Fix test_logging_to_driver and test_not_logging_to_driver #5462

Conversation

raulchen commented Aug 16, 2019 • edited Loading

Why are these changes needed?

What do these changes do?

Related issue number

Linter

AmplabJenkins commented Aug 16, 2019

robertnishihara commented Aug 16, 2019

robertnishihara Aug 16, 2019

Choose a reason for hiding this comment

robertnishihara Aug 16, 2019

Choose a reason for hiding this comment

robertnishihara commented Aug 16, 2019

robertnishihara Aug 16, 2019

Choose a reason for hiding this comment

raulchen commented Aug 17, 2019

robertnishihara left a comment

Choose a reason for hiding this comment

raulchen commented Aug 17, 2019

AmplabJenkins commented Aug 17, 2019

raulchen commented Aug 16, 2019 •

edited

Loading