Eval pipeline #152

dhall1995 · 2024-02-02T11:42:14Z

dhall1995
Feb 2, 2024

Hi there,

Thanks for this piece of work & the associated codebase. It's fantastic!

I was wondering whether you guys have an internal eval pipeline you use during training? Does this just consist of the evaluation scripts included in trainer.py?

What i'm imagining is similar to the notebook tutorials you guys provide but perhaps configurable and outputting some metrics on some standard tasks. It would, for example, be great to recreate the plots you guys provide in the paper which demonstrate model capability scaling with number of cells used in pre-training. In the future these kind of standardised tasks to evaluate on could help decide which extensions are useful or not.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Eval pipeline #152

{{title}}

Replies: 0 comments

Select a reply

Eval pipeline #152

dhall1995 Feb 2, 2024

Replies: 0 comments

dhall1995
Feb 2, 2024