Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix decoupled batch statistics to account for implicit batch size #361

Merged

Conversation

kthui
Copy link
Contributor

@kthui kthui commented May 22, 2024

Previous PRs:

The decoupled metrics reporting should consider implicit batch size when reporting batch statistics. Previously, the batch statistics uses the number of requests as batch size, which does not account for batched request(s). Now, it is modified to take the number of requests multiplied by its respective implicit batch size.

Next PRs:

@kthui
Copy link
Contributor Author

kthui commented May 31, 2024

@kthui kthui force-pushed the jacky-res-sender-fix-decouple-batch-stat branch from 89512e6 to d5e12b9 Compare May 31, 2024 22:16
@kthui kthui merged commit 4551e04 into jacky-res-sender-main May 31, 2024
3 checks passed
@kthui kthui deleted the jacky-res-sender-fix-decouple-batch-stat branch May 31, 2024 22:22
kthui added a commit that referenced this pull request Jun 6, 2024
* Add response sender to non-decoupled models and unify data pipelines (#360)

* Add response sender to non-decoupled model and unify data pipelines

* Rename variable and class name

* Fix decoupled batch statistics to account for implicit batch size (#361)

* Fix decoupled gpu output error handling (#362)

* Fix decoupled gpu output error handling

* Return full error string upon exception from model

* Response sender to check for improper non-decoupled model usage (#363)

* Response sender to check for improper non-decoupled model usage

* Force close response sender on exception

* Rename functions
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

2 participants