Follow up to autocomplete pr #317 #320

oandreeva-nv · 2023-11-03T22:27:14Z

This is a follow up PR to : #317 . Unfortunately, I missed essential part.

After we retrieved updated model configuration from core during the call:

Line 1864 in 0f12211

RETURN_IF_ERROR((*state)->SetModelConfig());

decoupled_ is not updated on the python backend site, since it is a part of ModelState.

I've added PropagateAutoCompletedConfig() that does exactly that, i.e. looks for model_transaction_policy in the model config (already filled by core), and if decoupled is found in model_transaction_policy , we update decoupled_ property.

Actual functionality is tested with vllm backend on this PR: triton-inference-server/vllm_backend#20

src/python_be.h

dyastremsky

Great work!

oandreeva-nv added 3 commits November 3, 2023 15:04

Fllow up with error msg

4ea37f6

Setting decoupled after autocomplete in ModelState:Create

da1266a

Refactor

c83ab69

oandreeva-nv requested review from krishung5, dyastremsky, tanmayv25 and rmccorm4 November 3, 2023 22:27

Refactor according to Tanmay discussion

ed20a3d

oandreeva-nv commented Nov 3, 2023

View reviewed changes

src/python_be.h Show resolved Hide resolved

dyastremsky approved these changes Nov 6, 2023

View reviewed changes

krishung5 approved these changes Nov 6, 2023

View reviewed changes

tanmayv25 approved these changes Nov 8, 2023

View reviewed changes

tanmayv25 merged commit 60a9091 into main Nov 8, 2023

tanmayv25 deleted the oandreeva_autocomplete_followup branch November 8, 2023 04:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Follow up to autocomplete pr #317 #320

Follow up to autocomplete pr #317 #320

oandreeva-nv commented Nov 3, 2023

dyastremsky left a comment

Follow up to autocomplete pr #317 #320

Follow up to autocomplete pr #317 #320

Conversation

oandreeva-nv commented Nov 3, 2023

dyastremsky left a comment

Choose a reason for hiding this comment