Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: use webgpu for FlowerCaptioner #135

Draft
wants to merge 22 commits into
base: master
Choose a base branch
from
Draft

Conversation

cristianglezm
Copy link
Owner

@cristianglezm cristianglezm commented Oct 19, 2024

TODO

  • cache manager (to delete models files)
  • make model options component for settings
  • refactor settings - split into components
    • AppOptions
    • CreationParams
    • mutationsRates
    • AppActions
    • make new layout for settings
  • change xenova/transformers to huggingface/transformers
  • test all models
    • fp32 works on CPU(it takes ~13s), GPU (it takes ~1m30s)
    • fp16 gives exception
    • q8 works on CPU (Firefox, Chrome, Edge)
    • q8 gives garbled output on Chrome and Edge (GPU)
    • int8 gives error on session creation (Can't create a session. ERROR_CODE: 9, ERROR_MESSAGE: Could not find an implementation for ConvInteger(10) node with name '/embeddings/patch_embeddings/projection/Conv_quant')
    • uint8 works on CPU
    • q4 works on CPU (it takes ~24s)
    • q4f16 gives exception.
    • bnb4 - works on CPU (it takes ~24s)
  • fix not working on Chrome or Edge GPU (Garbled descriptions)
  • Firefox webGPU not supported yet. (wait for it)

@cristianglezm cristianglezm self-assigned this Oct 19, 2024
@cristianglezm cristianglezm added enhancement New feature or request javascript Pull requests that update Javascript code labels Oct 19, 2024
@cristianglezm cristianglezm force-pushed the hf-transformers-v3 branch 2 times, most recently from 86b532a to 79f3f59 Compare November 15, 2024 16:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request javascript Pull requests that update Javascript code
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant