Changelog

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

Unreleased

Added

Updated code that records the fine-grained perplexity metrics per subdomain to also include perplexity over words, characters, bytes, and also bits per byte
Added option to track avg logit per token type
Added script that uses the tango steps as functions, and bypasses the tango caching mechanism, for simpler execution
minimal example of how to run Paloma from HF hub as well as step to output results in jsonl.gz format
Added OLMES-v0.1 evaluation standard
Add deprecation note and link to OLMES repo

Fixed

Fixed incorrect paths in readme
Fixed model names written to gsheet by run_lm_eval.py
Fixed hf_olmo module and function name that has changed

Changed

Updated default image in tango-in-beaker.yml