Add TGI additional options #402

yongfengdu · 2024-09-05T09:25:45Z

Add user configurable shm_size support.
Add interface for additional TGI cli parameters.

Description

The summary of the proposed changes as long as the relevant motivation and context.

Issues

List the issue or RFC link this PR is working on. If there is no such link, please mark it as n/a.

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds new functionality)
Breaking change (fix or feature that would break existing design and interface)

Dependencies

List the newly introduced 3rd party dependency if exists.

Tests

Describe the tests that you ran to verify your changes.

helm-charts/common/tgi/values.yaml

Add user configurable shm_size support. Add interface for additional TGI cli parameters. Signed-off-by: Dolpher Du <[email protected]>

daisy-ycguo

lgtm

eero-t · 2024-09-06T08:19:14Z

It seems that this is needed only when model requires more memory than fits to given device, i.e. it needs to be sharded over multiple devices using something like deepspeed?

This means that it would need to be in the relevant top-level DEVICE-values.yaml files, where the model and device allocations are specified. I.e. model, allocation for number of devices (>1) needed for it, matching SHM size, and TGI sharding options all need to go hand-in-hand.

yongfengdu requested review from zhlsunshine, KfreeZ and lianhao as code owners September 5, 2024 09:25

lianhao approved these changes Sep 5, 2024

View reviewed changes

eero-t reviewed Sep 5, 2024

View reviewed changes

helm-charts/common/tgi/values.yaml Outdated Show resolved Hide resolved

Add TGI additional options

f06a322

Add user configurable shm_size support. Add interface for additional TGI cli parameters. Signed-off-by: Dolpher Du <[email protected]>

yongfengdu force-pushed the tgifix branch from 885b87d to f06a322 Compare September 6, 2024 01:08

yongfengdu requested a review from eero-t September 6, 2024 07:35

daisy-ycguo approved these changes Sep 6, 2024

View reviewed changes

daisy-ycguo merged commit bf10bdd into opea-project:main Sep 6, 2024
12 checks passed

yongfengdu deleted the tgifix branch September 14, 2024 05:44

lianhao mentioned this pull request Nov 25, 2024

[ci-auto] Find common tgi user settings and make them configurable by different values #297

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add TGI additional options #402

Add TGI additional options #402

yongfengdu commented Sep 5, 2024

daisy-ycguo left a comment

eero-t commented Sep 6, 2024 •

edited

Loading

Add TGI additional options #402

Add TGI additional options #402

Conversation

yongfengdu commented Sep 5, 2024

Description

Issues

Type of change

Dependencies

Tests

daisy-ycguo left a comment

Choose a reason for hiding this comment

eero-t commented Sep 6, 2024 • edited Loading

eero-t commented Sep 6, 2024 •

edited

Loading