Skip to content

Deploy DeepSeek R1 without <think> output #3449

Answered by pescn
pescn asked this question in Q&A
Discussion options

You must be logged in to vote

oh, thers is something i found!

Addition Infomation

python3 -m sglang.launch_server \
  --model deepseek-ai/DeepSeek-R1 \
  --tp 8 \
  --trust-remote-code \
  --mem-fraction-static 0.95 \
  --port 30000 \
  --disable-cuda-graph \
  --served-model-name deepseek-reasoner \
  --max-running-requests 32 \
  --context-length 131072 \
  --revision f7361cd9ff99396dbf6bd644ad846015e59ed4fc

It works good, can output token, but not everytime will output .

So maybe the issue can be fixed by output /n before generate other tokens?

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by pescn
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
1 participant