Unicorn-Test

This repository compiles results of the unicorn test for various open-weights models.

This is not a good test. It is not even meant to be a good test.

However: Nobody is gaming it deliberately, because it is too silly. So it might be a better test than alternatives.

Rules

We only care about whether it can draw TikZ.
Consequently, import errors don't count, we give the model a good faith attempt to fix its import problems
We don't care what the model says to us that isn't TikZ code
We always use exactly the prompt "Draw a unicorn in TikZ".
Number of trials is totally variable. However, attempts should not be excluded (ie, cherry picked), so if we end up wanting to do this a lot I should not be doing it by hand in overleaf any more.

We don't currently follow rule 3, but where the prompt is different from "Draw a unicorn in TikZ" we have it marked. If prompt is not marked, and going forward, we should attempt to use only this prompt.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
Unicorn_tests.pdf		Unicorn_tests.pdf
Unicorn_tests.tex		Unicorn_tests.tex

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Unicorn-Test

Rules

About

Releases

Packages

Contributors 2

Languages

segyges/Unicorn-Test

Folders and files

Latest commit

History

Repository files navigation

Unicorn-Test

Rules

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages