Never exit the main loop in interactive mode. #297

tjohnman · 2023-03-19T17:05:21Z

If the end of stream token mark is found, when in interactive mode, ask for user input instead of exiting the main loop.

In case of running out of token budget, reset it and ask for user input.

With these changes, embd can end up empty and cause a crash in the next iteration of the loop, so we check for its size as well.

If the end of stream token mark is found, when in interactive mode, ask for user input instead of exiting the main loop. In case of running out of token budget, reset it and ask for user input. With these changes, embd can end up empty and cause a crash in the next iteration of the loop, so we check for its size as well.

…nteractive-mode

rabidcopy · 2023-03-19T17:30:44Z

So if I'm understanding this right, this PR handles both the end of text token AND the exit that occurs when you run out of tokens/context? Where exactly is the "memory" left after that? Does it reset and behave like you just started your initial prompt again with no user input/response history or? Sorry if this is obvious. Edit: Very useful nonetheless. Seems to work. Beats having to reload the model and have the initial prompt parsed again.

tjohnman · 2023-03-19T17:53:15Z

So if I'm understanding this right, this PR handles both the end of text token AND the exit that occurs when you run out of tokens/context? Where exactly is the "memory" left after that? Does it reset and behave like you just started your initial prompt again with no user input/response history or? Sorry if this is obvious. Edit: Very useful nonetheless. Seems to work. Beats having to reload the model and have the initial prompt parsed again.

It shouldn’t affect the memory. It just resets the counter that holds how many tokens it can generate before reaching the maximum specified in the parameters.

However, I did a mess with the commits. Perhaps this pull request should be rejected and done properly again. I don’t know if the mess can be fixed manually now.

I have little experience with pull requests, sorry.

tjohnman · 2023-03-19T17:54:38Z

Yep. I messed up and included changes from other stuff I was working on.

rabidcopy · 2023-03-19T18:21:59Z

This is a bit sloppy and hacked together and the embd.back() = 13; last_n_tokens.back() = 13; part may be unnecessary or even harmful at best. Mostly just a personal tweak that to ignore end of text tokens and running out of tokens while continuing generation. Posting for myself in the future.

        if (params.interactive) {
            if (embd.size() && embd.back() == 2) {
                fprintf(stderr, " [end of text]\n");
//                is_interacting = true;
//                embd.back() = 13;
//                last_n_tokens.back() = 13;
            }
            if (remaining_tokens == 0) {
                fprintf(stderr, " [0 tokens remaining]\n");
                remaining_tokens = params.n_predict;
//                is_interacting = true;
//                embd.back() = 13;
//                last_n_tokens.back() = 13;
            }
        } else {
            // end of text token
            if (embd.size() && embd.back() == 2) {
                fprintf(stderr, " [end of text]\n");
                break;
            }
        }

tjohnman · 2023-03-19T18:29:29Z

Made a proper pull request with just the necessary changes.
#298

…cpp-repro Update bug_report.md

Johnman added 6 commits March 19, 2023 17:10

Never exit the main loop in interactive mode.

10f1c9e

Pause sampling if waiting for user input.

b78caa6

Make prompt randomization optional.

c62cffc

Support for multiple reverse prompts.

80825b0

Merge branch 'master' of github.com:tjohnman/llama.cpp into eternal-i…

5ef2da2

…nteractive-mode

tjohnman closed this Mar 19, 2023

tjohnman deleted the eternal-interactive-mode branch March 20, 2023 15:36

Deadsg pushed a commit to Deadsg/llama.cpp that referenced this pull request Dec 19, 2023

Merge pull request ggml-org#297 from gjmulder/update-issue-tmpl-llama…

232880c

…cpp-repro Update bug_report.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Never exit the main loop in interactive mode. #297

Never exit the main loop in interactive mode. #297

tjohnman commented Mar 19, 2023

rabidcopy commented Mar 19, 2023 •

edited

Loading

tjohnman commented Mar 19, 2023

tjohnman commented Mar 19, 2023

rabidcopy commented Mar 19, 2023 •

edited

Loading

tjohnman commented Mar 19, 2023

Never exit the main loop in interactive mode. #297

Never exit the main loop in interactive mode. #297

Conversation

tjohnman commented Mar 19, 2023

rabidcopy commented Mar 19, 2023 • edited Loading

tjohnman commented Mar 19, 2023

tjohnman commented Mar 19, 2023

rabidcopy commented Mar 19, 2023 • edited Loading

tjohnman commented Mar 19, 2023

rabidcopy commented Mar 19, 2023 •

edited

Loading

rabidcopy commented Mar 19, 2023 •

edited

Loading