Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: Models should filter outputs based on requested outputs #366

Merged
merged 3 commits into from
Jun 12, 2024

Conversation

kthui
Copy link
Contributor

@kthui kthui commented Jun 10, 2024

What does the PR do?

Models on Python backend should filter outputs based on requested outputs from the client.

Checklist

  • PR title reflects the change and is of format <commit_type>: <Title>
  • Changes are described in the pull request.
  • Related issues are referenced.
  • Populated github labels field
  • Added test plan and verified test passes.
  • Verified that the PR passes existing CI.
  • Verified copyright is correct on all changed files.
  • Added succinct git squash message before merging ref.
  • All template sections are filled out.
  • Optional: Additional screenshots for behavior/output changes with before/after.

Commit Type:

Check the conventional commit type
box here and add the label to the github PR.

  • build
  • ci
  • docs
  • feat
  • fix
  • perf
  • refactor
  • revert
  • style
  • test

Related PRs:

triton-inference-server/server#7338

Where should the reviewer start?

N/A

Test plan:

Add additional tests to L0_backend_python to affirm the behavior of non-decoupled and decoupled models on filtering outputs from requested outputs.

  • CI Pipeline ID:
    Refer to the related server PR.

Caveats:

N/A

Background

The non-decoupled data pipeline previously does filter outputs based on requested output, so this PR essentially brings this behavior back after switching non-decoupled models to use decoupled data pipeline.

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

N/A

@kthui kthui added the bug Something isn't working label Jun 10, 2024
@kthui kthui marked this pull request as ready for review June 11, 2024 16:11
@kthui kthui requested review from Tabrizian, rmccorm4 and GuanLuo June 11, 2024 16:15
src/response_sender.cc Outdated Show resolved Hide resolved
src/infer_request.cc Outdated Show resolved Hide resolved
src/infer_request.cc Outdated Show resolved Hide resolved
@kthui kthui changed the title fix: Non-decoupled models should filter outputs based on requested outputs fix: Models should filter outputs based on requested outputs Jun 11, 2024
@kthui kthui merged commit bfabfdb into main Jun 12, 2024
3 checks passed
@kthui kthui deleted the jacky-py-req-output branch June 12, 2024 16:35
rmccorm4 pushed a commit that referenced this pull request Jun 14, 2024
* Prune non requested outputs from non-decoupled models

* Prune non requested outputs from decoupled models

* [chore] Remove redundant copy
nv-kmcgill53 pushed a commit that referenced this pull request Jun 17, 2024
…367)

* Prune non requested outputs from non-decoupled models

* Prune non requested outputs from decoupled models

* [chore] Remove redundant copy
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Development

Successfully merging this pull request may close these issues.

4 participants