自定义多轮对话数据集，只学习最后一轮对话 #5165

cat-knight · 2024-08-13T09:16:20Z

Reminder

I have read the README and searched the existing issues.

System Info

略

Reproduction

略

Expected behavior

如题，现在 SFT 的自定义数据集会学习历史对话。那如何只学习最后一轮对话呢。

Others

No response

YeQiuO · 2024-08-13T10:48:34Z

去看看这两个PR #4878 #5115

具体来说，在 sft 下使用 mask_history 参数

Syno8 · 2024-08-23T07:20:20Z

@YeQiuO hi,我发现我使用这个参数以后，我在验证集上的损失，是逐渐上升的

YeQiuO · 2024-08-23T07:23:57Z

@YeQiuO hi,我发现我使用这个参数以后，我在验证集上的损失，是逐渐上升的

训练loss降低，验证loss升高，那就是过拟合了呗，优化下数据吧

Syno8 · 2024-08-23T07:25:13Z

原因是什么呢？
是加了这个参数，所需要的数据量要求变多了嘛？
特意做了一下消融实验，觉得奇怪🤔

YeQiuO · 2024-08-23T07:31:29Z

原因是什么呢？是加了这个参数，所需要的数据量要求变多了嘛？特意做了一下消融实验，觉得奇怪🤔

这个参数就是把之前的对话历史全部mask，只训练最后一轮对话呀
如果你的数据不是类似思维链，需要推理的。那么可以认为你只训练了模型针对长历史对话训练的能力，而验证集的对话历史又比较短的话，就会出现loss上升吧。

总而言之，这个参数可以认为模型只训练了最后一轮对话

Syno8 · 2024-08-23T08:02:30Z

不是这个原因，我验证集是从训练集分出来的一部分，除非是验证集是算上历史的损失；

查了一下代码，初步判断是这个原因：eval的时候，没有生效这个mask history参数

YeQiuO · 2024-08-23T08:13:18Z

eval的时候确实不需要mask_history吧？lf的eval本来就只会测试最后一轮对话

…

---- 回复的原邮件 ---- | 发件人 | ***@***.***> | | 日期 | 2024年08月23日 16:02 | | 收件人 | ***@***.***> | | 抄送至 | Richard ***@***.***>***@***.***> | | 主题 | Re: [hiyouga/LLaMA-Factory] 自定义多轮对话数据集，只学习最后一轮对话 (Issue #5165) | 不是这个原因，我验证集是从训练集分出来的一部分，除非是验证集是算上历史的损失；查了一下代码，初步判断是这个原因：eval的时候，没有生效这个mask history参数 — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you were mentioned.Message ID: ***@***.***>

Syno8 · 2024-08-26T02:23:54Z

如果训练集需要加，那验证集也要加的，否则就会推理和训练过程不一致的；

而且我加了之后，训练eval loss 正常了

franklyd · 2024-10-14T07:53:36Z

Hi @Syno8 想请教一下，验证集如何加入这个mask history参数呢？
需要改代码还是加入某一个参数？感谢！

github-actions bot added the pending This problem is yet to be addressed label Aug 13, 2024

codemayq added solved This problem has been already solved and removed pending This problem is yet to be addressed labels Aug 15, 2024

codemayq closed this as completed Aug 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

自定义多轮对话数据集，只学习最后一轮对话 #5165

自定义多轮对话数据集，只学习最后一轮对话 #5165

cat-knight commented Aug 13, 2024

YeQiuO commented Aug 13, 2024 •

edited

Loading

Syno8 commented Aug 23, 2024

YeQiuO commented Aug 23, 2024

Syno8 commented Aug 23, 2024 •

edited

Loading

YeQiuO commented Aug 23, 2024

Syno8 commented Aug 23, 2024

YeQiuO commented Aug 23, 2024 via email

Syno8 commented Aug 26, 2024

franklyd commented Oct 14, 2024

自定义多轮对话数据集，只学习最后一轮对话 #5165

自定义多轮对话数据集，只学习最后一轮对话 #5165

Comments

cat-knight commented Aug 13, 2024

Reminder

System Info

Reproduction

Expected behavior

Others

YeQiuO commented Aug 13, 2024 • edited Loading

Syno8 commented Aug 23, 2024

YeQiuO commented Aug 23, 2024

Syno8 commented Aug 23, 2024 • edited Loading

YeQiuO commented Aug 23, 2024

Syno8 commented Aug 23, 2024

YeQiuO commented Aug 23, 2024 via email

Syno8 commented Aug 26, 2024

franklyd commented Oct 14, 2024

YeQiuO commented Aug 13, 2024 •

edited

Loading

Syno8 commented Aug 23, 2024 •

edited

Loading