Regarding evaluation format #19

anandsubramanian123 · 2023-10-23T01:57:33Z

Hi there! I had a query regarding the evaluation of PMC-LLaMA on the MCQ datasets you had used in the paper. I was trying to evaluate models such as PMC-LLaMA on other datasets, and I was curious about the format in which the question was provided to the model. Would it be possible to obtain a sample of how a MEDQA/MEDMCQA question was formatted before it is provided to the model?

Additionally, could you also confirm if models like MedAlpaca and ChatDoctor were evaluated with questions provided in the same format as PMCLLama?

Thank you

SuperBruceJia · 2024-08-24T16:55:49Z

Please check our codes for your reference:
https://github.com/vkola-lab/medpodgpt/blob/main/utils/benchmark_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regarding evaluation format #19

Regarding evaluation format #19

anandsubramanian123 commented Oct 23, 2023

SuperBruceJia commented Aug 24, 2024

Regarding evaluation format #19

Regarding evaluation format #19

Comments

anandsubramanian123 commented Oct 23, 2023

SuperBruceJia commented Aug 24, 2024