Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug]: Deepseek-v3 performace on benchmark didn't match with paper #11971

Closed
1 task done
jongjyh opened this issue Jan 12, 2025 · 4 comments · Fixed by #12002
Closed
1 task done

[Bug]: Deepseek-v3 performace on benchmark didn't match with paper #11971

jongjyh opened this issue Jan 12, 2025 · 4 comments · Fixed by #12002
Labels
bug Something isn't working

Comments

@jongjyh
Copy link

jongjyh commented Jan 12, 2025

Your current environment

The output of `python collect_env.py`
Your output of `python collect_env.py` here

Model Input Dumps

No response

🐛 Describe the bug

Hi guys,

I used vllm to serve deepseek-v3 while I found the benchmark didn't reproduce the result on paper. Specifically, in my case deepseek-v3 got 82 on CEval comparing to 90 on paper.

Are there any details I missed? be willing to reveal any detail of my settings.

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.
@jongjyh jongjyh added the bug Something isn't working label Jan 12, 2025
@Wen1163204547
Copy link

遇到相同问题,搭个车
deepseek-ai/DeepSeek-V3#212

@SunflowerAries
Copy link
Contributor

As far as I know, there exist some difference between vllm and DeepSeek v3's official inference code.

#12002

@Wen1163204547
Copy link

As far as I know, there exist some difference between vllm and DeepSeek v3's official inference code.

#12002

Thank you for your help. After the modification, it indeed aligns well. Many thanks!

@jongjyh
Copy link
Author

jongjyh commented Jan 14, 2025

Thanks!!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants