https://huggingface.co/deepseek-ai/deepseek-llm-67b-chat
https://huggingface.co/deepseek-ai/deepseek-llm-67b-base
Knowledge cutoff May 2023, not bad.
Online demo: https://chat.deepseek.com/ (Google oauth login)
another Chinese model, demo is censored by keywords, not that censored on local.
I’m desensitized at this point. I wonder if this is yet another Pretraining on the Test Set Is All You Need marketing stunt or not, as most new models lately have been.
I threw my reasoning test questions at the web version and it performed worse than most 70B i tried. About the level of Yi.