[Bug]: Deepseek-v3 performace on benchmark didn't match with paper #11971

jongjyh · 2025-01-12T12:59:18Z

Your current environment

The output of `python collect_env.py`

Your output of `python collect_env.py` here

Model Input Dumps

No response

🐛 Describe the bug

Hi guys,

I used vllm to serve deepseek-v3 while I found the benchmark didn't reproduce the result on paper. Specifically, in my case deepseek-v3 got 82 on CEval comparing to 90 on paper.

Are there any details I missed? be willing to reveal any detail of my settings.

Before submitting a new issue...

Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

The text was updated successfully, but these errors were encountered:

Wen1163204547 · 2025-01-13T08:56:11Z

遇到相同问题，搭个车
deepseek-ai/DeepSeek-V3#212

SunflowerAries · 2025-01-13T10:17:19Z

As far as I know, there exist some difference between vllm and DeepSeek v3's official inference code.

#12002

Wen1163204547 · 2025-01-13T15:42:08Z

As far as I know, there exist some difference between vllm and DeepSeek v3's official inference code.

#12002

Thank you for your help. After the modification, it indeed aligns well. Many thanks!

jongjyh · 2025-01-14T03:12:06Z

Thanks!!

jongjyh added the bug Something isn't working label Jan 12, 2025

mgoin mentioned this issue Jan 13, 2025

[Bugfix] Fix deepseekv3 gate bias error #12002

Merged

mgoin closed this as completed in #12002 Jan 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: Deepseek-v3 performace on benchmark didn't match with paper #11971

[Bug]: Deepseek-v3 performace on benchmark didn't match with paper #11971

jongjyh commented Jan 12, 2025 •

edited

Loading

Wen1163204547 commented Jan 13, 2025

SunflowerAries commented Jan 13, 2025

Wen1163204547 commented Jan 13, 2025

jongjyh commented Jan 14, 2025

[Bug]: Deepseek-v3 performace on benchmark didn't match with paper #11971

[Bug]: Deepseek-v3 performace on benchmark didn't match with paper #11971

Comments

jongjyh commented Jan 12, 2025 • edited Loading

Your current environment

Model Input Dumps

🐛 Describe the bug

Before submitting a new issue...

Wen1163204547 commented Jan 13, 2025

SunflowerAries commented Jan 13, 2025

Wen1163204547 commented Jan 13, 2025

jongjyh commented Jan 14, 2025

jongjyh commented Jan 12, 2025 •

edited

Loading