feat: add local model support and examples #147

vacekj · 2024-12-10T13:48:29Z

this PR adds a client impl for calling local models, such as ollama and lm studio. it also duplicates some examples, such as simple chat, embeddings and tools calls. it also adds a collab example with local models, that demonstrates models working together to find a good prompt for themselves.

I have verified all of the examples work for me, on an M1 Max w/ ollama latest.

cvauclair · 2024-12-10T15:33:54Z

Hey @vacekj thank you for your contribution and interest in Rig, this is seriously awesome to see! We'll review this ASAP!

In the meantime, I've enabled CI on this PR so if you could address the couple linting/styling issues that might pop up that would be awesome. Cheers!

vacekj · 2024-12-10T17:49:33Z

I belive my last commit should address the singular lint issue. Please feel free to re-run the CI to see if everything's alright.

cvauclair · 2024-12-10T19:34:05Z

FYI @vacekj the currently failing tests are known to fail when a PR is made from an outside contributor, so no need to worry about those

akashicMarga · 2024-12-11T14:14:34Z

@vacekj @cvauclair instead of local.rs in providers we can have ollama.rs, lm_studio.rs, as there are other servers too which run locally like llama.cpp, LLMedge, ?

vacekj · 2024-12-11T15:31:49Z

@akashicMarga All of the projects you mentioned use the standard OpenAI API. I don't think it's necessary to duplicate the client., as it would be the same interface for LM Studio, ollama and LLMEdge.

akashicMarga · 2024-12-11T15:36:02Z

@akashicMarga All of the projects you mentioned use the standard OpenAI API. I don't think it's necessary to duplicate the client., as it would be the same interface for LM Studio, ollama and LLMEdge.

thought so, I was just thinking let's say in future if any local server that follows some other format where it will be integrated. so local_opena_ai_compatible.rs, or something on this line?

cvauclair · 2024-12-11T15:51:32Z

@akashicMarga @vacekj I think it's fine to have a local.rs for local openai-compatible servers. However, if say for example it turns out that using the ollama base API (and not the openai compatible one) works better, then we could make a dedicated client for it in ollama.rs.

cvauclair · 2024-12-11T15:57:23Z

@vacekj are there differences between your local.rs provider client and the openai.rs one? The latter allows the user to set the url with from_url which you could use to point the client to a local server, so perhaps a lot of code can be reused 🤔

0xMochan · 2024-12-12T02:04:23Z

@vacekj are there differences between your local.rs provider client and the openai.rs one? The latter allows the user to set the url with from_url which you could use to point the client to a local server, so perhaps a lot of code can be reused 🤔

Yea, the OpenAI compatibility server from ollama works fine using just the OpenAI client which is also what the documentation recommends when using the official openai pypi library

use rig::{completion::Prompt, providers};

#[tokio::main]
async fn main() -> Result<(), anyhow::Error> {
    // Create an Ollama client using the OpenAI compatibility layer
    let client = providers::openai::Client::from_url("ollama", "http://localhost:11434");

    // Create agent with a single context prompt
    let comedian_agent = client
        .agent("llama3.2:latest")
        .preamble("You are a comedian here to entertain the user using humour and jokes.")
        .build();

    // Prompt the agent and print the response
    let response = comedian_agent.prompt("Entertain me!").await?;
    println!("{}", response);

    Ok(())
}

There is the normal Ollama endpoint that would be worth to create an ollama.rs file for. For this PR to continue, it would be suitable to go for a direct ollama integration to use those specific features!

vacekj · 2024-12-13T22:30:05Z

Great, then I will implement the API ollama lists here: https://github.com/ollama/ollama/blob/main/docs/api.md#generate-a-chat-completion

That could unlock some cool new capabilities, like an agent pulling a new model and spawning it on demand. Agents spawning other agents, like recruiting for an army hehe.

cvauclair · 2024-12-16T22:09:15Z

@vacekj I'll close the PR for now to reduce clutter and because we're talking about a new whole new client implementation here. We can re-open this one later or create a new one once the new integration will be ready for review.

As always, thanks for contributing :)

vacekj added 2 commits December 10, 2024 14:39

feat: add local model support and examples

9e9fcfc

feat: replace local debate with local collab

916db2a

fix: address lint errors

6f63a2e

cvauclair linked an issue Dec 10, 2024 that may be closed by this pull request

feat: Local Models Support #143

Open

1 task

fix: lints

625eb83

cvauclair requested a review from 0xMochan December 10, 2024 19:33

cvauclair closed this Dec 16, 2024

0xMochan mentioned this pull request Dec 28, 2024

feat: Local Models Support #143

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add local model support and examples #147

feat: add local model support and examples #147

vacekj commented Dec 10, 2024

cvauclair commented Dec 10, 2024

vacekj commented Dec 10, 2024

cvauclair commented Dec 10, 2024

akashicMarga commented Dec 11, 2024

vacekj commented Dec 11, 2024

akashicMarga commented Dec 11, 2024

cvauclair commented Dec 11, 2024

cvauclair commented Dec 11, 2024

0xMochan commented Dec 12, 2024

vacekj commented Dec 13, 2024

cvauclair commented Dec 16, 2024

feat: add local model support and examples #147

feat: add local model support and examples #147

Conversation

vacekj commented Dec 10, 2024

cvauclair commented Dec 10, 2024

vacekj commented Dec 10, 2024

cvauclair commented Dec 10, 2024

akashicMarga commented Dec 11, 2024

vacekj commented Dec 11, 2024

akashicMarga commented Dec 11, 2024

cvauclair commented Dec 11, 2024

cvauclair commented Dec 11, 2024

0xMochan commented Dec 12, 2024

vacekj commented Dec 13, 2024

cvauclair commented Dec 16, 2024