Add named models support on AI workers #215

Angelmmiguel · 2023-09-13T04:05:53Z

Is your feature request related to a problem? Please describe.

Currently, workers that run inference must retrieve the ML model in memory to pass it back to the host. This limits the number of models we can run as Wasm has a memory limit of 4Gb (32-bit).

Describe the solution you'd like

The WASI-NN proposal recently introduced the "Named Models" feature. This feature allows the host to preload a set of models and expose them to the modules using a label. The model no longer need to load the model in memory. It only needs to reference the model using the right label.

wws can take advantage of this feature by adding new configuration parameters to the feature.wasi_nn object. Then, wws will preload those models and expose them to the workers.

The wasmtime-wasi-nn and wasi-nn (see example) crates already supports this feature.

Describe alternatives you've considered

No response

Additional context

No response

The text was updated successfully, but these errors were encountered:

Angelmmiguel added the 🚀 enhancement New feature or request label Sep 13, 2023

Angelmmiguel added this to the v1.6.0 milestone Sep 13, 2023

Angelmmiguel self-assigned this Sep 26, 2023

This was referenced Sep 26, 2023

Bump Wasmtime to 13.0.0 #220

Closed

feat: allow to preload ML models when running inference #224

Merged

Angelmmiguel closed this as completed in #224 Sep 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add named models support on AI workers #215

Add named models support on AI workers #215

Angelmmiguel commented Sep 13, 2023

Add named models support on AI workers #215

Add named models support on AI workers #215

Comments

Angelmmiguel commented Sep 13, 2023

Is your feature request related to a problem? Please describe.

Describe the solution you'd like

Describe alternatives you've considered

Additional context