-
Notifications
You must be signed in to change notification settings - Fork 433
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Switching to the Fast Model and Performance Considerations #849
Comments
/cc @ia0 Here are some answers:
Hope this helps! |
Thank you for your earlier response! I conducted a benchmark using a
Thanks again for your insights. |
Hello, thanks for these tests and for your notes! Some answers/comments: About 1: yes, the fast model should be ~4x than standard one. But the inference times you are getting are rather high! On my machine, I get ~6.2ms for the standard_v2_1 model. About 2: interesting. I didn't know about this potential performance boost based on the ort version. @ia0 can we look into this? which version are we using right now? About 3: correct, fast is faster but with lower accuracy. That being said, I just merged a new |
About 2: This is most probably "Tensor We currently use rc9 since #821 (November 2024) but this is not published yet. The latest published version uses rc8. I would suggest publishing after each change, we don't modify the code so often that it makes sense to wait for multiple changes. This will also make it easier for publishing Python since there's no need to wait to publish Rust first, since it's always in a published state. |
We just released a new -rc version, which ships the latest rust client, the last model (which should be significantly faster), and a also a pure python wheel to install magika on non-super-common platforms. https://pypi.org/project/magika/0.6.1rc0/ You should be able to install and test with Please let us know if you run into any issues, thanks! |
How can I enable the "fast model"? Does it prioritize speed over accuracy, and what impact might this have on tasks like file type identification?
https://github.com/google/magika/tree/main/assets/models/fast_v2_1
Is there a way to control or restrict the file types supported by the model? If so, would narrowing down the list of supported file types improve overall performance?
From a performance standpoint, can the model achieve file type identification in less than 5ms per file buffer?
Thank you
The text was updated successfully, but these errors were encountered: