evals

Here are 8 public repositories matching this topic...

AgentOps-AI / agentops

Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks like CrewAI, Langchain, and Autogen

agent ai openai evaluation-metrics mistral cost-estimation autogen groq agentops llm langchain anthropic evals ollama crewai

Updated Dec 1, 2024
Python

superlinear-ai / raglite

Star

🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with PostgreSQL or SQLite

Updated Dec 1, 2024
Python

AIAnytime / rag-evaluator

Star

A library for evaluating Retrieval-Augmented Generation (RAG) systems (The traditional ways).

eval rag evals

Updated Aug 10, 2024
Python

dustalov / evalica

Sponsor

Star

Evalica, your favourite evaluation toolkit

Updated Dec 1, 2024
Python

openlayer-ai / templates

Star

Our curated collection of templates. Use these patterns to set up your AI projects for evaluation with Openlayer.

ai examples evals

Updated Sep 25, 2024
Python

The-Swarm-Corporation / StatisticalModelEvaluator

Star

An implementation of the Anthropic's paper and essay on "A statistical approach to model evaluations"

ai ml multiagent agents llms evals llm-evals agent-evals multi-agent-eval

Updated Nov 25, 2024
Python

noah-art3mis / crucible

Star

Develop better LLM apps by testing different models and prompts in bulk.

ai llm prompt-engineering evals

Updated Jul 29, 2024
Python

modelmetry / modelmetry-sdk-python

Star

The Modelmetry Python SDK allows developers to easily integrate Modelmetry’s advanced guardrails and monitoring capabilities into their LLM-powered applications.

monitoring openai observability guardrails ai-observability large-language-models llm llmops evals llm-evaluation

Updated Aug 24, 2024
Python

Improve this page

Add a description, image, and links to the evals topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the evals topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

evals

Here are 8 public repositories matching this topic...

AgentOps-AI / agentops

superlinear-ai / raglite

AIAnytime / rag-evaluator

dustalov / evalica

openlayer-ai / templates

The-Swarm-Corporation / StatisticalModelEvaluator

noah-art3mis / crucible

modelmetry / modelmetry-sdk-python

Improve this page

Add this topic to your repo