Replies: 1 comment
-
I haven't tried rlhf datasets yet. You may be able to run preprocess with |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Dataset Used: argilla/dpo-mix-7k
!! The fields are dict instead of strings. !!
So I have written a script to tokenize them and then save the tokenized prompts in a dataset, which I am trying to then load and do the training with custom dataset type.
The training is running currently, but I am not sure if this is the correct way to do this. Can anyone confirm?
Beta Was this translation helpful? Give feedback.
All reactions