[Bug] Mirostat samplers don't work properly with parallel generation #3537

KerfuffleV2 · 2023-10-08T00:03:46Z

This is because llama_sample_token in common.cpp uses a static for mirostat1 and 2 mu. Because of this, different sequences will affect each other (including ones that were already deleted).

The fix for this doesn't really seem that simple. I don't think it can be done only inside llama_sample_token. I think llama_sample_token is going to have to get changed to take something like a sequence-specific sampler state structure where stuff like that sequence's mu could get stored. Then it would be up to the app to reset mu when appropriate (like the sequence ends and the slot will be reused).

The text was updated successfully, but these errors were encountered:

ggerganov · 2023-10-08T09:13:19Z

Yup, these statics should be removed. If the change for adding a sampling state is too big, we should probably disable mirostat sampling to avoid confusion, until the issue is resolved

KerfuffleV2 · 2023-10-08T10:05:45Z

I submitted #3543, ~~but personally I don't really like that approach. (Pretty simple changes though.)~~

KerfuffleV2 added bug Something isn't working generation quality Quality of model output labels Oct 8, 2023

KerfuffleV2 mentioned this issue Oct 8, 2023

Fix mirostat state when using multiple sequences #3543

Merged

ggerganov closed this as completed in #3543 Oct 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] Mirostat samplers don't work properly with parallel generation #3537

[Bug] Mirostat samplers don't work properly with parallel generation #3537

KerfuffleV2 commented Oct 8, 2023

ggerganov commented Oct 8, 2023

KerfuffleV2 commented Oct 8, 2023 •

edited

Loading

[Bug] Mirostat samplers don't work properly with parallel generation #3537

[Bug] Mirostat samplers don't work properly with parallel generation #3537

Comments

KerfuffleV2 commented Oct 8, 2023

ggerganov commented Oct 8, 2023

KerfuffleV2 commented Oct 8, 2023 • edited Loading

KerfuffleV2 commented Oct 8, 2023 •

edited

Loading