Keep support for model v1 within the rust client #593

yaniv5678 · 2024-07-24T13:04:38Z

Hi,
Is there any possibility to add support for running v1 version of the model?
Maybe in a Cargo.toml feature.

Thanks!

reyammer · 2024-09-17T09:19:08Z

Hello,

what would the use case for this? Ensure backward compatibility? Or you noticed specific issues with v2? Additional context would help!

yaniv5678 · 2024-09-17T10:25:21Z

As far as I seen, v2 is slower than v1 as it is a larger model, so we prefer using v1 for working with data in scale :)

reyammer · 2024-09-17T10:47:19Z

thanks for the feedback, this is very useful! /cc @ia0 @invernizzi

We do have a smaller version of our v2 models (https://github.com/google/magika/tree/main/assets/models/fast_v2_1), it can be easily used by the python module (by pointing model_dir to it), but it's not integrated yet with the rust codebase (and it's currently not so trivial to do so).

We'll discuss internally on how to approach this. In the meantime, please let us know if you have additional context to share. For example: it seems you are integrating the magika rust cli within an existing pipeline... in which language is this pipeline written? If, for example, the pipeline is written in python, the python module would be the wait to go: most of rust's performance improvements are about avoiding the initial one-off starting time, and after things are loaded in python, the inference time rust vs python should be roughly the same.

era · 2025-01-08T23:58:47Z

Hi @reyammer, I also noticed the difference in performance from v1 to v2. I enabled fast_v2_1 by changing the symlink and making the hardcoded model.rs match the config.min.json.

It seems to work fine :).

So I'm guessing the code changes needed to support the fast_v2_1 are done?

Anyway, commenting here in the hopes that this helps people who want a faster model than the v2.

reyammer · 2025-01-10T19:03:38Z

So I'm guessing the code changes needed to support the fast_v2_1 are done?

Yes, the rust client supports all v2 models; and what you did is the way to use the fast model.

We are thinking on how to allow the rust client to pick which model to use and how to prioritize this over other features.

In addition, we may soon have another model that is significantly faster with pretty much the same accuracy (still WIP / in testing at the moment), so maybe this problem will solve itself soon.

yaniv5678 · 2025-01-11T10:11:47Z

Yeah @era that's what I've also done, it works well indeed :)

@reyammer note that on windows machines the symlink stuff does not work well, I think it's related to something with git.
The content of the link file in a windows machine is the relative path to the symlink itself, instead of being a proper windows link.

And regarding the new model - sounds really cool that it's gonna be even faster! Waiting for this.

ia0 · 2025-01-11T10:57:51Z

note that on windows machines the symlink stuff does not work well

Indeed, we don't support development on Windows, only the published library and binary are cross-platform. However, this can be easily fixed by duplicating the rust/lib/src/model.onnx file with a CI script to make sure it's in sync, and completely avoiding rust/gen/model by using a constant in the code.

@reyammer do we want to support development on Windows? I can make a PR.

era · 2025-01-13T01:57:10Z

However, this can be easily fixed by duplicating the rust/lib/src/model.onnx file with a CI script to make sure it's in sync, and completely avoiding rust/gen/model by using a constant in the code.

@reyammer @ia0 maybe a similar idea could be used to support different models? Gate the different models behind a crate feature and the needed changes. Here is a quick and dirty example of what I mean: https://github.com/google/magika/compare/main...era:magika:main?expand=1 (in case the link does not work this and this commits)

The only thing is that features in Rust must be additive, so although unlikely it would need to support people who compiled it with multiple models: https://doc.rust-lang.org/cargo/reference/features.html#feature-unification

Anyway, thank you all for the hard work and looking forward to the faster model 🚀

ia0 · 2025-01-13T09:46:01Z

Gate the different models behind a crate feature

Yes, that's one option, but we want to possibly go further (depending on user need). The current design I have in mind is:

Make the library generic over the actual model.
Support "static dispatch": The library ships with a set of models backed in the source. You have compile-time guarantees on which content types the model can return.
Support "dynamic dispatch": You can load a model from a file. It doesn't need to be shipped with the library.
Gate which models end up in the static set with a cargo feature for each model. (For those who care about code size.)
Gate the dynamic dispatch behind a cargo feature. (Same rationale.)

(Note that cargo features are additive with this option.)

invernizzi added the rust Pull requests that update Rust code label Aug 8, 2024

reyammer changed the title ~~Model v1~~ Keep support for model v1 within the rust client Sep 19, 2024

reyammer mentioned this issue Jan 10, 2025

Switching to the Fast Model and Performance Considerations #849

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Keep support for model v1 within the rust client #593

Keep support for model v1 within the rust client #593

yaniv5678 commented Jul 24, 2024

reyammer commented Sep 17, 2024

yaniv5678 commented Sep 17, 2024

reyammer commented Sep 17, 2024

era commented Jan 8, 2025 •

edited

Loading

reyammer commented Jan 10, 2025

yaniv5678 commented Jan 11, 2025

ia0 commented Jan 11, 2025

era commented Jan 13, 2025 •

edited

Loading

ia0 commented Jan 13, 2025

Keep support for model v1 within the rust client #593

Keep support for model v1 within the rust client #593

Comments

yaniv5678 commented Jul 24, 2024

reyammer commented Sep 17, 2024

yaniv5678 commented Sep 17, 2024

reyammer commented Sep 17, 2024

era commented Jan 8, 2025 • edited Loading

reyammer commented Jan 10, 2025

yaniv5678 commented Jan 11, 2025

ia0 commented Jan 11, 2025

era commented Jan 13, 2025 • edited Loading

ia0 commented Jan 13, 2025

era commented Jan 8, 2025 •

edited

Loading

era commented Jan 13, 2025 •

edited

Loading