Skip to content

My own set of general evaluations to be shared between projects

Notifications You must be signed in to change notification settings

aengusl/tasks-goose

 
 

Repository files navigation

Tasks

A ground-up evaluation library that is designed for any model inference function that accepts tokens and outputs logits. Useful for evaluating a variety of types of interventions on a model with maximum flexibility.

Design

Task

About

My own set of general evaluations to be shared between projects

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 84.9%
  • Python 15.1%