How to fine-tune downstream tasks #1

GeorgeBGM · 2023-09-12T14:45:49Z

Dear developer，

The dna language model is trained using gpt2 or gpt3 ？How to fine-tune it in downstream tasks? It's seems that this model does not appear to be available directly through the Hugging face platform at this time.

Best, Du

doublechenching · 2023-09-14T02:32:52Z

The model is trained using GPT2. Weights is not compatible with huggingface. You need to write a script to convert weights. or refer to the training code of nanoGPT

GeorgeBGM · 2023-09-14T04:40:28Z

Dear developer，

Is there any plan to release it to the huggingface platform in the future? I would like to fine-tune it and use it for sequence classification tasks, are there any suggestions? At this moment, it looks like it can only be used directly for sequence classification.

Best, Du

doublechenching · 2023-09-14T07:59:27Z

Yes, we are preparing to release the model on huggingface now，it will take about two or three weeks。You can also use the current code and it can also do a variety of tasks, like sequence classification, regression and generation， just comment out the task branches that are not needed

GeorgeBGM · 2023-09-14T08:46:50Z

well, sound good.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to fine-tune downstream tasks #1

How to fine-tune downstream tasks #1

GeorgeBGM commented Sep 12, 2023 •

edited

Loading

doublechenching commented Sep 14, 2023

GeorgeBGM commented Sep 14, 2023

doublechenching commented Sep 14, 2023

GeorgeBGM commented Sep 14, 2023

How to fine-tune downstream tasks #1

How to fine-tune downstream tasks #1

Comments

GeorgeBGM commented Sep 12, 2023 • edited Loading

doublechenching commented Sep 14, 2023

GeorgeBGM commented Sep 14, 2023

doublechenching commented Sep 14, 2023

GeorgeBGM commented Sep 14, 2023

GeorgeBGM commented Sep 12, 2023 •

edited

Loading