Close Menu
timesmoguls.com
  • News
  • Entertainment
  • Politics
  • Business
  • Tech
  • Lifestyle
  • Health
  • Science
  • Sports
Featured

Trump says he approves a new strike “absolutely” if Iran enriches uranium – National

Funding from MEDICAID Planned Parenthood can be reduced by States: Supreme Court – National

Global Newshania Twain: “I have the impression that there is a meeting that I feel with my audience” look at Shania Twain: “I have the impression that there is a meeting that I feel with my audience” online, on globalnews.ca..15 hours

Subscribe to Updates

Get the latest news from timesmoguls.

Facebook X (Twitter) Instagram
  • Home
  • About us
  • Contact us
  • Disclaimer
  • Privacy policy
  • Terms and services
Facebook X (Twitter) Instagram Pinterest
timesmoguls.com
Contact us
HOT TOPICS
  • News
  • Entertainment
  • Politics
  • Business
  • Tech
  • Lifestyle
  • Health
  • Science
  • Sports
timesmoguls.com
You are at:Home»Technology»How Deepseek has torn the AI ​​game book – and why everyone will follow him
Technology

How Deepseek has torn the AI ​​game book – and why everyone will follow him

February 4, 2025003 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Piggy Savings.jpg
Share
Facebook Twitter LinkedIn Pinterest Email

There is more. To use its learning to strengthen as effective as possible, Deepseek has also developed a new algorithm called optimization of the relative group policy (GRPO). He first used GRPO a year ago to build a model called Deepseekmath.

We will jump the details– You just need to know that the learning of strengthening involves calculating a score to determine whether a potential movement is good or bad. Many existing learning strengthening techniques require an entire separate model to carry out this calculation. In the case of models of large languages, this means a second model which could be as expensive to build and execute as the first. Instead of using a second model to predict a score, GRPO simply makes an enlightened supposition. It’s cheap, but always precise enough to work.

A common approach

The use by Deepseek of Reinforcement Learning is the main innovation that the company describes in its article R1. But Deepseek is not the only farm experimenting with this technique. Two weeks before the R1 fall, a team from Microsoft Asia announced a model called RSTAR-Math, which was formed in a similar way. “He also has huge jumps of performance,” explains Matt Zeiler, founder and CEO of the Ai Clarifai firm.

The AI2 tulu was also built using effective strengthening learning techniques (but in addition, not instead, steps led by humans such as supervised fine adjustment and RLHF). And the face of the American society rushing to rush to reproduce R1 with OpenR1, a clone of the Deepseek model according to which the hopes of the face embraced will expose even more of the ingredients of the special R1 sauce.

In addition, it is a secret of Polichinelle that the best companies like Openai, Google Deepmind and Anthropic can already use their own versions of the Deepseek approach to form their new generation of models. “I’m sure they are doing almost exactly the same thing, but they will have their own flavor,” says Zeiler.

But Deepseek has more than one turn in his round. He formed his basic model V3 to do something called Multi-Token prediction, where the model learns to predict a chain of words at a time at a place at a time. This training is cheaper and is also increasing precision. “If you think about how you talk, when you are half a sentence, you know what the rest of the sentence will be,” says Zeiler. “These models should also be able to.”

He also found cheaper ways to create large data sets. To train the model of last year, Deepseekmath, it took a free data set entitled Common Crawl – a large number of documents scratched from the Internet – and used an automated process to extract only documents that included mathematical problems . It was much cheaper than building a new data data set by hand. It was also more effective: the common ramp includes many more mathematics than any other set of specialized mathematics available.

And on the material side, Deepseek has found new ways of old tokens juice, which allows him to form high level models without coughing for the last equipment on the market. Half of their innovation comes from direct engineering, known as Zeiler: “They definitely have very good GPU engineers in this team.”

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleThe State Department dismiss around 60 entrepreneurs working on democracy and human rights
Next Article Musk Starink treats with the Ontario government in a few hours after the threat to tear it away

Related Posts

The Municipal Council considers the security of technological surveillance data security – NBC 7 San Diego

June 28, 2025

The technology of organ fleas precisely predicts the response to chemotherapy in patients with esophageal adenocarcinoma

June 28, 2025

American Airlines apologizes for delays due to “technological problems” – NBC Chicago

June 28, 2025
Add A Comment
Leave A Reply Cancel Reply

We Are Social
  • Facebook
  • Twitter
  • Instagram
  • YouTube
News
  • Business (1,974)
  • Entertainment (2,002)
  • Global News (2,140)
  • Health (1,911)
  • Lifestyle (1,894)
  • Politics (1,766)
  • Science (1,895)
  • Sports (1,933)
  • Technology (1,918)
Latest

Trump says he approves a new strike “absolutely” if Iran enriches uranium – National

The Municipal Council considers the security of technological surveillance data security – NBC 7 San Diego

The reunion of Cult-Classic ’80s Movie Stars make fans howl “my childhood”

Featured

Trump says he approves a new strike “absolutely” if Iran enriches uranium – National

The Municipal Council considers the security of technological surveillance data security – NBC 7 San Diego

The reunion of Cult-Classic ’80s Movie Stars make fans howl “my childhood”

We Are Social
  • Facebook
  • Twitter
  • Instagram
  • YouTube
News
  • Business (1,974)
  • Entertainment (2,002)
  • Global News (2,140)
  • Health (1,911)
  • Lifestyle (1,894)
  • Politics (1,766)
  • Science (1,895)
  • Sports (1,933)
  • Technology (1,918)
© 2025 Designed by timesmoguls
  • Home
  • About us
  • Contact us
  • Disclaimer
  • Privacy policy
  • Terms and services

Type above and press Enter to search. Press Esc to cancel.