Bug: --chat-template seems to be broken now, no way to truly chat from the llama-cli #8053

Deputation · 2024-06-21T10:34:34Z

What happened?

As per discussions:

It seems to be impossible to chat with llama3 8b properly. I have not tested this on 70b models but even in the server UI the model just starts making notes to itself and output garbage / training data as to how it should converse instead of actually conversing. Has something happened to the --chat-template chatml parameter? Even when the CLI is set to output special tokens, I do not see the ChatML tokens coming out.

Name and Version

version: 3158 (5239925)

What operating system are you seeing the problem on?

Linux

Relevant log output

No response

The text was updated successfully, but these errors were encountered:

ericonr · 2024-06-21T14:21:30Z

I'm configuring the prompt as suggested by this comment #6747 (comment), and it's worked pretty well.

dspasyuk · 2024-06-21T17:11:42Z

@Deputation I agree, the same issue I see on my end with llama3-instruct 8b. I have been told to use "right" prompt style but even with Llama3 prompt style it gives ok response maybe once then just give random garbage in llama-cli. The same issue with the llama-server. Version B3080 works fine after that no luck. Also when ctx size get exceeded from multiple questions, model just stops generating altogether. I am still waiting for @ggerganov and team on this issue. Screencast from 2024-06-21 11:10:00 AM.webm

dspasyuk · 2024-06-22T19:31:23Z

@Deputation try running llama-cli like this:

../llama.cpp/llama-cli --model models/meta-llama-3-8b-instruct_q5_k_s.gguf --n-gpu-layers 35 -cnv  --interactive-first  --simple-io  -b 512 -n -1 --ctx_size 0 --temp 0.3 --top_k 10 --multiline-input  --repeat_penalty 1.12 -t 6 -r "\n>" --log-disable  -p 'Role and Purpose: You are Alice, a large language model. Your purpose is to assist users by providing information, answering questions, and engaging in meaningful conversations based on the data you were trained on.
Behavior and Tone:  Be informative, engaging, and respectful. Maintain a neutral and unbiased tone. Ensure that responses are clear and concise. Capabilities: Use your training data to provide accurate and relevant information. Explain complex concepts in an easy-to-understand manner. Provide sources when referencing specific information or data.
Output Formatting: Use this formatting for code: ```language\n```'

And then chat like so: <|im_start|>user
Answer the following questions:

The day before two days after the day before tomorrow is Saturday. What day is it today?
What is the square root of 169?
Solve the equation 3y = 6y + 11 and find y.
There are two ducks in front of a duck, two ducks behind a duck, and a duck in the middle. How many ducks are there?
How many days does it take to travel from New York City to London by plane, assuming non-stop flights and average speeds?
What are the products of the chemical reaction between salicylic acid and acetic anhydride?
If five cats can catch five mice in five minutes, how long will it take one cat to catch one mouse?
Create a JS program that prints the first 100 Fibonacci numbers. <|im_end|>
<|im_start|>assistant

I do not know why it works but it does, setting --chat-template to llama3 and using the correct reverse prompt for it does not work as expected in my hands. :(

You can try this out with llama.cui test branch: https://github.com/dspasyuk/llama.cui/tree/test

Deputation added bug-unconfirmed low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches) labels Jun 21, 2024

ngxson mentioned this issue Jun 22, 2024

Add chat template support for llama-cli #8068

Merged

4 tasks

dspasyuk mentioned this issue Jun 24, 2024

Bug: Persistent hallucination even after re-running llama.cpp #8070

Closed

mofosyne closed this as completed in #8068 Jun 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug: --chat-template seems to be broken now, no way to truly chat from the llama-cli #8053

Bug: --chat-template seems to be broken now, no way to truly chat from the llama-cli #8053

Deputation commented Jun 21, 2024

ericonr commented Jun 21, 2024 •

edited

Loading

dspasyuk commented Jun 21, 2024 •

edited

Loading

dspasyuk commented Jun 22, 2024 •

edited

Loading

Bug: --chat-template seems to be broken now, no way to truly chat from the llama-cli #8053

Bug: --chat-template seems to be broken now, no way to truly chat from the llama-cli #8053

Comments

Deputation commented Jun 21, 2024

What happened?

Name and Version

What operating system are you seeing the problem on?

Relevant log output

ericonr commented Jun 21, 2024 • edited Loading

dspasyuk commented Jun 21, 2024 • edited Loading

dspasyuk commented Jun 22, 2024 • edited Loading

ericonr commented Jun 21, 2024 •

edited

Loading

dspasyuk commented Jun 21, 2024 •

edited

Loading

dspasyuk commented Jun 22, 2024 •

edited

Loading