Chat format: Recognize specified language and offloaded lexguessing to every newline #81
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
#71 (comment)
#71 (comment)
I found some time :) Now it's detecting the specified language (I like to finetune my models to always do that) and if not, it only lexguesses each newline.
The PR is nothing too big, but I thought this might help nonetheless.
Here is the prompt I found most effective for Llama2-7B-chat 4.0bpw to specify the language:
-sp "You are a helpful coding assistant. Always answer as helpfully as possible. Specify the language after starting a codeblock like: ```python\nprint('hello')\n"
#71 (comment)
You said you wanted to improve the lexguesser, but still, if I can somehow help or if you want to spend your time on other problems, please let me know and I'll try to take care of it.