This open source version performs differently from the official trial version. Are these two different versions?

#12
by ZHUYONGJUN - opened

The performance of the same prompt is inconsistent between the open source version and the trial version of https://chat.deepseek.com/. The official website trial version performs much better, while the open source version performs very poorly. Are these two different versions?

ZHUYONGJUN changed discussion title from 这个开源版本与官网试用版本表现效果不一样,这是两个不同的版本吗? to This open source version performs differently from the official trial version. Are these two different versions?

reduce the Repetition penalty to 1, the code will be much better, and closely resemble what is generated on the website. (tested multiple times with pong and snake)

DeepSeek org

The official website https://chat.deepseek.com/ has recently been updated in January with an SFT-optimized model, so it will be somewhat better than the open-source version. This model might be open-sourced in the future.

luofuli changed discussion status to closed

Sign up or log in to comment