You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Experimental support of AVX512 in manually vectorized functions: this code path is not enabled by default but can be enabled by setting the environment variable CT2_FORCE_CPU_ISA=AVX512
Add Transformers converter option copy_files to copy any files from the Hugging Face model to the converted model directory
Expose some Whisper parameters:
max_initial_timestamp_index
suppress_blank
suppress_tokens
Fixes and improvements
Reduce conversion time for large models by skipping some weights comparisons
Reduce maximum memory usage when converting Transformers models with --quantization float16
Set FP32 compute type for FP16 convolutions to match the PyTorch behavior and accuracy