New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Async generation not implemented for this LLM. #2

Open

akashAD98 opened this issue Oct 18, 2023 · 1 comment

akashAD98 commented Oct 18, 2023

i tried mistral & llama7b from ctransofrmer & getting this issue,is there any way to add support for this?
how can we implement it with websocket?

    streaming_llm = CTransformers(model='TheBloke/Mistral-7B-v0.1-GGUF', model_file='mistral-7b-v0.1.Q4_K_M.gguf',model_type="mistral")

    streaming_llm = CTransformers(model="llama-2-7b.ggmlv3.q5_0.bin",model_type="llama",
                    config={'max_new_tokens':128,'temperature':0.01})

can we use TGI here for doing this ? by passing the url link will it support?

The text was updated successfully, but these errors were encountered:

janfilips commented Nov 6, 2023

Mistral 7B rulez

Imho it's the langchain that is outdated on this project that's causing an issue

Did you find a workaround or a different solution? I too am in need to proxy SSE over to Websocket in my app..

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment