Thursday, January 30, 2025

Is DeepSeek a Sputnik moment?

It may seem so given the daily, hyper reporting and the recent, sharp drop in value of NASDAQ tech stocks, but I doubt it!

The Communist Party of China certainly achieved a publicity stunt, but more and more news about the Deepseek models seem to suggest that DeepSeek is more hype than substance.

I am currently reading one of several of DeepSeek's published, preprint research papers, i.e. [2501.12948] DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning, but I am not very impressed. Stay tuned, I will soon publish my review about this paper.




No comments: