Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Core] Fix post teardown error #2005

Merged
merged 1 commit into from
Jun 1, 2023
Merged

Conversation

Michaelvll
Copy link
Collaborator

@Michaelvll Michaelvll commented Jun 1, 2023

The previous PR #1595 introduced a bug that the post teardown will be executed even though the cluster is terminated already, this should will cause issues for failover when the VM is actually launched.

To reproduce: sky launch --use-spot --cloud aws --gpus A100:8

Tested (run the relevant ones):

  • Any manual or new tests for this PR (please specify below)
    • sky launch --cloud aws --gpus A100:8 --use-spot
  • All smoke tests: pytest tests/test_smoke.py
  • Relevant individual smoke tests: pytest tests/test_smoke.py::test_fill_in_the_name
  • Backward compatibility tests: bash tests/backward_comaptibility_tests.sh

@Michaelvll Michaelvll added the P0 label Jun 1, 2023
@Michaelvll Michaelvll changed the title Fix post teardown error [Core] Fix post teardown error Jun 1, 2023
@Michaelvll Michaelvll requested a review from romilbhardwaj June 1, 2023 01:56
@Michaelvll Michaelvll merged commit fe11b8e into master Jun 1, 2023
@Michaelvll Michaelvll deleted the fix-post-teardown-error branch June 1, 2023 02:00
concretevitamin pushed a commit that referenced this pull request Jun 3, 2023
concretevitamin pushed a commit that referenced this pull request Jun 4, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants