Skip to content

Commit

Permalink
Remove mentioning increase instance count case
Browse files Browse the repository at this point in the history
  • Loading branch information
kthui committed Jul 11, 2023
1 parent 7a2b967 commit 5af5584
Showing 1 changed file with 0 additions and 3 deletions.
3 changes: 0 additions & 3 deletions docs/user_guide/model_management.md
Original file line number Diff line number Diff line change
Expand Up @@ -229,9 +229,6 @@ the model file), Triton does not guarentee any remaining request(s) from the
in-flight sequence(s) will be routed to the same model instance for processing.
It is currently the responsibility of the user to ensure any in-flight
sequence(s) are completed before reloading a sequence model.
* If a sequence model is *updated* (i.e. increasing/decreasing the instance
count), Triton will wait until the in-flight sequence is completed (or
timed-out) before the instance behind the sequence is removed.

## Concurrently Loading Models

Expand Down

0 comments on commit 5af5584

Please sign in to comment.