Thread: The A.I. Thread
View Single Post
Old 01-27-2025, 07:40 AM   #522
Firebot
#1 Goaltender
 
Join Date: Jul 2011
Exp:
Default

Some pretty massive news in AI land that has some pretty incredible impact worthy of a bump, especially to American AI giants who have convinced investors that the only way to get to AGI was by throwing insane amounts of cash at the problem while keeping all the research private to profit from.

A small team in China called Deepseek, on a pet project, built an LLM thinking model that rivals o1 from Open AI with Deepseek R1. That in itself is quite the feat to see a Chinese company suddenly at the top of the charts, but what is truly remarkable is they built the model with only 6 million dollars in training cost, AND have released the papers detailing the method AND released the model as an open source model. Everyone can now have their own open source LLM thinking model if they have the computing power for it. They are also doing it on inferior chips due to the ban imposed by the US on Nvidia providing chips to China. And anyone can also use their API at a fraction of cost compared to o1 and o3 models.

https://www.reddit.com/r/LocalLLaMA/...ane/?rdt=33050

This is an absolute game changer, and the market is in full panic mode.

NVDA is down 12% in one blow. NASDAQ down 3% so far already. This may well just crash the AI bubble (for the better IMO) as investors wake up to realize they were sold snake oil by salesmen who just lost their secret.

I tried it for myself very briefly, it built a working version of Tetris in one simple prompt in Python.

https://www.deepseek.com/

If you want to try it out. (yes it's heavily censored if you are asking it for China sensitive political topics, but that should be given).

Last edited by Firebot; 01-27-2025 at 07:52 AM.
Firebot is offline   Reply With Quote
The Following 11 Users Say Thank You to Firebot For This Useful Post: