Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

大佬您好,训练完以后,当进行预测时,我们需要提供什么样的数据格式呢,执行代码有参考吗? #18

Open
studylyc opened this issue Oct 18, 2023 · 3 comments

Comments

@studylyc
Copy link

大佬您好,训练完以后,当进行预测时,我们需要提供什么样的数据格式呢,执行代码有参考吗?

@ysf-gd
Copy link

ysf-gd commented Mar 5, 2024

我也有相同的困惑,请问拟解决了吗?

@flust
Copy link
Collaborator

flust commented Mar 5, 2024 via email

@ysf-gd
Copy link

ysf-gd commented Mar 21, 2024

想问一下:假如我用3000条匹配数据(包含匹配成功和匹配不成功的数据),其中有2000个user,60个jd,那么我在预测时是不是只能预测这2000个user和60个jd之间的匹配关系? 我这么说是因为我看到模型在predict时需要user和jd的文本向量,而这个文本向量的生成是基于这些文本单词的顺序及次数,在构建模型时如果没有向量化 新的数据的文本向量(除了2000个user60个jd之外的),那么是无法进行预测的。还是说我在构建模型时需要把我所有的user和jd都进行向量化,只是输入模型中的是匹配部分的user和jd

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants