Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Evaluate prompts for topic classification #8

Open
7 of 8 tasks
InMatrix opened this issue Sep 8, 2024 · 0 comments
Open
7 of 8 tasks

Evaluate prompts for topic classification #8

InMatrix opened this issue Sep 8, 2024 · 0 comments

Comments

@InMatrix
Copy link
Owner

InMatrix commented Sep 8, 2024

  • Create an eval prototype with dummy data (tag)
  • Create an evaluation dataset
    • Use the metadata of the 50 papers retrieved from 2024-09-20
    • Update eval config file with the actual topics of interest
    • Create golden responses through manually selecting papers that match the topics of interest.
    • Save the golden results
  • Use the promptfoo eval tool to compute prediction metrics such as recall, precision, and f1.
  • Realign the prompt developed through the eval process and the prompt in production
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant