make compress_pos_emb float

Originally, compress_pos_emb had a slider gradation of 0.01. However, webui now uses gr.Number instead of gr.Slider to represent the value (to avoid having to repeatedly update max value limit) and the precision was set to 0. Setting precision to 0.01 to return to previous behaviour.
oobabooga · Jul 26, 2024 · 221ec15 · 221ec15
1 parent b80d590
commit 221ec15
Showing 1 changed file with 1 addition and 1 deletion.
diff --git a/modules/ui_model_menu.py b/modules/ui_model_menu.py
@@ -105,7 +105,7 @@ def create_ui():
                             with gr.Blocks():
                                 shared.gradio['alpha_value'] = gr.Number(label='alpha_value', value=shared.args.alpha_value, precision=2, info='Positional embeddings alpha factor for NTK RoPE scaling. Recommended values (NTKv1): 1.75 for 1.5x context, 2.5 for 2x context. Use either this or compress_pos_emb, not both.')
                                 shared.gradio['rope_freq_base'] = gr.Number(label='rope_freq_base', value=shared.args.rope_freq_base, precision=0, info='Positional embeddings frequency base for NTK RoPE scaling. Related to alpha_value by rope_freq_base = 10000 * alpha_value ^ (64 / 63). 0 = from model.')
-                                shared.gradio['compress_pos_emb'] = gr.Number(label='compress_pos_emb', value=shared.args.compress_pos_emb, precision=0, info='Positional embeddings compression factor. Should be set to (context length) / (model\'s original context length). Equal to 1/rope_freq_scale.')
+                                shared.gradio['compress_pos_emb'] = gr.Number(label='compress_pos_emb', value=shared.args.compress_pos_emb, precision=2, info='Positional embeddings compression factor. Should be set to (context length) / (model\'s original context length). Equal to 1/rope_freq_scale.')
 
                             shared.gradio['autogptq_info'] = gr.Markdown('ExLlamav2_HF is recommended over AutoGPTQ for models derived from Llama.')