fix: add providerId, maxTokens and topk to ai spec #545

samirtahir91 · 2024-10-23T19:24:59Z

Closes # #535

📑 Description

We need to set maxTokens, providerId and topk for backends like google gemini and googlevertexai, else they fail with related errors.

This PR adds new environment variables for K8SGPT_MAX_TOKENS (default 2048) and K8SGPT_TOP_K (default 50) into the k8sgpt deployment spec.

It optionally adds the env var K8SGPT_PROVIDER_ID to the deployment spec

The new K8SGPT CRD specs introduced for this are:

ai:
  providerId
  maxTokens
  topk

I have tested with vertex ai in GKE with a custom image of k8sgpt based off this PR k8sgpt-ai/k8sgpt#1280 with the example:

apiVersion: core.k8sgpt.ai/v1alpha1
kind: K8sGPT
metadata:
  name: k8sgpt-sample-vertexai
  namespace: k8sgpt-operator-system
spec:
  ai:
    model: gemini-1.5-pro-002
    backend: googlevertexai
    providerId: <my gcp project ID>
    enabled: true
    anonymized: true
    maxTokens: "2048"
    topk: "40"
  version: latest
  repository: samirtahir91076/k8sgpt
  noCache: false

✅ Checks

My pull request adheres to the code style of this project
My code requires changes to the documentation
I have updated the documentation as required
All the tests have passed

ℹ Additional Information

This PR depends on k8sgpt-ai/k8sgpt#1280 for K8SGPT_MAX_TOKENS env var to configure k8sgpt server config.

Signed-off-by: samir-tahir <[email protected]>

samirtahir91 · 2024-11-13T11:06:21Z

@AlexsJones Could you review this please?

samirtahir91 · 2024-11-27T13:22:19Z

@AlexsJones - Can this be reviewed pls?

samirtahir91 requested review from a team as code owners October 23, 2024 19:25

samirtahir91 mentioned this pull request Oct 23, 2024

fix: add maxTokens to serve mode k8sgpt-ai/k8sgpt#1280

Merged

4 tasks

fix: add providerId, maxTokens and topk to ai spec

8811f2b

Signed-off-by: samir-tahir <[email protected]>

samirtahir91 force-pushed the fix/add-provider-and-max-tokens branch from 9610901 to 8811f2b Compare October 23, 2024 20:01

Merge branch 'main' into fix/add-provider-and-max-tokens

e691a58

Merge branch 'main' into fix/add-provider-and-max-tokens

aa513c5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: add providerId, maxTokens and topk to ai spec #545

fix: add providerId, maxTokens and topk to ai spec #545

samirtahir91 commented Oct 23, 2024

samirtahir91 commented Nov 13, 2024

samirtahir91 commented Nov 27, 2024

fix: add providerId, maxTokens and topk to ai spec #545

Are you sure you want to change the base?

fix: add providerId, maxTokens and topk to ai spec #545

Conversation

samirtahir91 commented Oct 23, 2024

📑 Description

✅ Checks

ℹ Additional Information

samirtahir91 commented Nov 13, 2024

samirtahir91 commented Nov 27, 2024