Calgarypuck Forums - The Unofficial Calgary Flames Fan Community

Go Back   Calgarypuck Forums - The Unofficial Calgary Flames Fan Community > Main Forums > The Off Topic Forum
Register Forum Rules FAQ Community Calendar Today's Posts Search

Reply
 
Thread Tools Search this Thread
Old 01-28-2025, 12:35 PM   #541
activeStick
Franchise Player
 
activeStick's Avatar
 
Join Date: Jan 2014
Exp:
Default

Quote:
Originally Posted by Fuzz View Post
There is also no reason the other models can't adopt this technique, and still leverage their extra power and model sizes. Will be interesting to see if that is another step improvement.
Yup.

https://twitter.com/user/status/1883912157768913377

https://twitter.com/user/status/1883912485335629865

https://twitter.com/user/status/1883924181118771519

https://twitter.com/user/status/1883924872499462164
activeStick is offline   Reply With Quote
The Following 7 Users Say Thank You to activeStick For This Useful Post:
Old 01-28-2025, 12:39 PM   #542
Shazam
Franchise Player
 
Shazam's Avatar
 
Join Date: Aug 2005
Location: Memento Mori
Exp:
Default

Quote:
Originally Posted by Fuzz View Post
Thanks for the link. Sounds like it is a bit novel, as they used a different feedback mechanism that doesn't appear to have caused any major issues. This will save a lot of training/money and I'd expect it to get refined further.
I dunno. Next thing we'll find out a bunch of political dissidents were the labour behind the reinforcement learning.
__________________
If you don't pass this sig to ten of your friends, you will become an Oilers fan.
Shazam is offline   Reply With Quote
Old 01-28-2025, 01:08 PM   #543
Fuzz
Franchise Player
 
Fuzz's Avatar
 
Join Date: Mar 2015
Location: Pickle Jar Lake
Exp:
Default

That would obviously be bad, but it sounds like it's all handled with code.




Quote:
  1. Start with a smart normal model, like DeepSeek-V3, and perform the following reinforcement-learning loop
  2. Ask that model to solve a mathematical problem, with a prompt that pushes it to think step-by-step
  3. Verify the answer in code (i.e. not with a model, but by directly parsing the answer and checking it)
  4. If correct, reward the model; if wrong, punish the model
  5. Repeat for a long time
The asking the model part is probably more manual, as they'd need to create a list of problems, though I suspect a lot of this is done and grabbed from elsewhere.
Fuzz is offline   Reply With Quote
Old 01-28-2025, 02:33 PM   #544
indes
First Line Centre
 
indes's Avatar
 
Join Date: Nov 2010
Location: Sherwood Park, AB
Exp:
Default

How come ChatGPT wasn't able to create an AI like this? Doesn't it seem like a black mark to the entire "AI" community if this small group of people with 6.5 million dollars just smoked these corporations with billions of dollars worth of processing power/AI development?
indes is offline   Reply With Quote
Old 01-28-2025, 02:38 PM   #545
Fuzz
Franchise Player
 
Fuzz's Avatar
 
Join Date: Mar 2015
Location: Pickle Jar Lake
Exp:
Default

Because they thought about the problem differently and came up with a different solution. It's how science works, and it's a good validator that money and resources aren't everything.
I hope someone thinks differently and realizes LLM's are kind of a dead end, and finds a better way. You can use enough tricks to make people believe an LLM is thinking(and it may be convincing and good enough that it doesn't' matter), but an LLM will never be developed into a thinking machine.
Fuzz is offline   Reply With Quote
Old 01-28-2025, 03:52 PM   #546
Jason14h
Franchise Player
 
Join Date: Oct 2005
Exp:
Default

Because in reality they probably had the entire force of the Chinese government behind them !

Wait until you read about the amazing things going on in North Korea !

In more semi seriousness - their founder is a genius Quant and used AI trading to get rich , so probably knows about as much about this stuff as anyone on the planet

Last edited by Jason14h; 01-28-2025 at 03:54 PM.
Jason14h is offline   Reply With Quote
Old 01-29-2025, 07:33 AM   #547
Fuzz
Franchise Player
 
Fuzz's Avatar
 
Join Date: Mar 2015
Location: Pickle Jar Lake
Exp:
Default

Quote:
OpenAI says it has found evidence that Chinese artificial intelligence start-up DeepSeek used the US company’s proprietary models to train its own open-source competitor, as concerns grow over a potential breach of intellectual property.

The San Francisco-based ChatGPT maker told the Financial Times it had seen some evidence of “distillation”, which it suspects to be from DeepSeek.

The technique is used by developers to obtain better performance on smaller models by using outputs from larger, more capable ones, allowing them to achieve similar results on specific tasks at a much lower cost.
https://www.ft.com/content/a0dfedd1-...c-1fe01de87ea6

How'd you get that Ferrari, anyway? Doesn't make sense.

Oh, well, you see, I stole it.

China gonna China.
Fuzz is offline   Reply With Quote
Old 01-29-2025, 07:44 AM   #548
GoFlamesGo89
Crash and Bang Winger
 
Join Date: Feb 2009
Location: home away from home
Exp:
Default

Quote:
Originally Posted by Fuzz View Post
https://www.ft.com/content/a0dfedd1-...c-1fe01de87ea6

How'd you get that Ferrari, anyway? Doesn't make sense.

Oh, well, you see, I stole it.

China gonna China.

Take one more step— where does OpenAI’s training data come from?

Scraped from across the internet— corporation stealing from individuals. America going to America.
GoFlamesGo89 is offline   Reply With Quote
Old 01-29-2025, 07:57 AM   #549
Shazam
Franchise Player
 
Shazam's Avatar
 
Join Date: Aug 2005
Location: Memento Mori
Exp:
Default

__________________
If you don't pass this sig to ten of your friends, you will become an Oilers fan.
Shazam is offline   Reply With Quote
The Following User Says Thank You to Shazam For This Useful Post:
Old 01-29-2025, 07:58 AM   #550
Shazam
Franchise Player
 
Shazam's Avatar
 
Join Date: Aug 2005
Location: Memento Mori
Exp:
Default

Quote:
Originally Posted by GoFlamesGo89 View Post
Take one more step— where does OpenAI’s training data come from?

Scraped from across the internet— corporation stealing from individuals. America going to America.
I mean search engines wouldn't work otherwise either.
__________________
If you don't pass this sig to ten of your friends, you will become an Oilers fan.
Shazam is offline   Reply With Quote
Old 01-29-2025, 08:14 AM   #551
OldDutch
#1 Goaltender
 
OldDutch's Avatar
 
Join Date: Oct 2009
Location: North of the River, South of the Bluff
Exp:
Default

Quote:
Originally Posted by Fuzz View Post
https://www.ft.com/content/a0dfedd1-...c-1fe01de87ea6

How'd you get that Ferrari, anyway? Doesn't make sense.

Oh, well, you see, I stole it.

China gonna China.
Super wild and surprising I guess. CCP funded hackers stole the data, passed it to deep seek. Voila 6M AI.
OldDutch is offline   Reply With Quote
Old 01-29-2025, 08:16 AM   #552
Fuzz
Franchise Player
 
Fuzz's Avatar
 
Join Date: Mar 2015
Location: Pickle Jar Lake
Exp:
Default

They didn't steal the data, they used the OpenAI model to train their own.
Fuzz is offline   Reply With Quote
Old 01-29-2025, 08:25 AM   #553
Shazam
Franchise Player
 
Shazam's Avatar
 
Join Date: Aug 2005
Location: Memento Mori
Exp:
Default

Now I am going to train my AI using Deepseek.
__________________
If you don't pass this sig to ten of your friends, you will become an Oilers fan.
Shazam is offline   Reply With Quote
Old 01-29-2025, 08:31 AM   #554
GoFlamesGo89
Crash and Bang Winger
 
Join Date: Feb 2009
Location: home away from home
Exp:
Default

Quote:
Originally Posted by Shazam View Post
I mean search engines wouldn't work otherwise either.

Search engines (by and large) respect copyright and have agreements. The OpenAI training data uses substantial copywrite information.
https://www.reddit.com/r/technews/co...in_generative/


I'm not taking one side over another here-- just that there is no moral high ground.
GoFlamesGo89 is offline   Reply With Quote
Old 01-29-2025, 08:39 AM   #555
Shazam
Franchise Player
 
Shazam's Avatar
 
Join Date: Aug 2005
Location: Memento Mori
Exp:
Default

Quote:
Originally Posted by GoFlamesGo89 View Post
Search engines (by and large) respect copyright and have agreements. The OpenAI training data uses substantial copywrite information.
https://www.reddit.com/r/technews/co...in_generative/


I'm not taking one side over another here-- just that there is no moral high ground.
Search engines make money from copyright content. It's just not done directly.
__________________
If you don't pass this sig to ten of your friends, you will become an Oilers fan.
Shazam is offline   Reply With Quote
Old 01-29-2025, 10:46 AM   #556
Russic
Dances with Wolves
 
Russic's Avatar
 
Join Date: Jun 2006
Location: Section 304
Exp:
Default

Quote:
Originally Posted by indes View Post
How come ChatGPT wasn't able to create an AI like this? Doesn't it seem like a black mark to the entire "AI" community if this small group of people with 6.5 million dollars just smoked these corporations with billions of dollars worth of processing power/AI development?
I find one of the more interesting thought experiments with anything is "what would happen if we had to do this twice as fast with half the resources?" That's basically the position China was put into. Creativity thrives within constraints sometimes, and the big companies in the US didn't have many. They kept gaining billions of dollars, so they didn't need to whoa back and re-evaluate. It's guerilla warfare.
Russic is offline   Reply With Quote
The Following User Says Thank You to Russic For This Useful Post:
Old 01-29-2025, 10:57 AM   #557
Jason14h
Franchise Player
 
Join Date: Oct 2005
Exp:
Default

https://www.reuters.com/technology/a...v3-2025-01-29/

Alibaba now #1
Jason14h is offline   Reply With Quote
Old 01-29-2025, 11:00 AM   #558
Jason14h
Franchise Player
 
Join Date: Oct 2005
Exp:
Default

Quote:
Originally Posted by Russic View Post
I find one of the more interesting thought experiments with anything is "what would happen if we had to do this twice as fast with half the resources?" That's basically the position China was put into. Creativity thrives within constraints sometimes, and the big companies in the US didn't have many. They kept gaining billions of dollars, so they didn't need to whoa back and re-evaluate. It's guerilla warfare.
Its called IP theft. It's easier to steal then create.

China has much more resources then any tech company in the States if need be.....

Anyone believing this was created for 6 million needs to give their head a shake.

They have all but admitted they have 50,000 illegal Nvidia Chips. These chips are $30-40 K each AND China would be buying on the black market.

The chips alone are 2.5 billion if bought at $50K each (Black Market prices?)
Jason14h is offline   Reply With Quote
Old 01-29-2025, 11:05 AM   #559
Shazam
Franchise Player
 
Shazam's Avatar
 
Join Date: Aug 2005
Location: Memento Mori
Exp:
Default

https://arstechnica.com/tech-policy/...ore-robots-txt
__________________
If you don't pass this sig to ten of your friends, you will become an Oilers fan.
Shazam is offline   Reply With Quote
Old 01-29-2025, 11:05 AM   #560
Fuzz
Franchise Player
 
Fuzz's Avatar
 
Join Date: Mar 2015
Location: Pickle Jar Lake
Exp:
Default

Quote:
Originally Posted by Jason14h View Post
That sounds like a self-claim, and they are also claiming to beat Deepseek-V3, not Deepseek R-1. So at best, even their self claim is for #2, not #1.
Fuzz is offline   Reply With Quote
Reply

Tags
they will overtake us


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT -6. The time now is 03:51 AM.

Calgary Flames
2024-25




Powered by vBulletin® Version 3.8.4
Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.
Copyright Calgarypuck 2021 | See Our Privacy Policy