OpenAI Reinforcement learning

DeepSeek-R1’s bold bet on reinforcement learning: How it outpaced OpenAI at 3% of the cost

DeepSeek-R1’s Monday release has sent shockwaves through the AI community, disrupting assumptions about what’s required to achieve cutting-edge AI performance. This story focuses on exactly how DeepSeek managed this feat,

OpenAI, DeepSeek

· 11h

OpenAI finds DeepSeek used its data to train R1 reasoning model

· 21h · on MSN

DeepSeek used OpenAI’s model to train its competitor using ‘distillation,’ White House AI czar says

· 1dunite

DeepSeek vs. OpenAI: The Battle of Open Reasoning Models

unite2d

DeepSeek-R1: Transforming AI Reasoning with Reinforcement Learning

DeepSeek-R1 is the groundbreaking reasoning model introduced by China-based DeepSeek AI Lab. This model sets a new benchmark in reasoning capabilities for open-source AI. As detailed in the accompanying research paper,

Hosted on MSN16h

China’s DeepSeek Model Outpaces OpenAI—Sam Altman Says OpenAI Data Was Used ‘Unfairly’

DeepSeek, a Chinese AI research lab, has released an advanced AI model which rivals leading models from OpenAI. The DeepSeek-R1 model can perform complicated mathematical reasoning, code generation, and more with fewer resources than its American competitors.

Open-source DeepSeek-R1 uses pure reinforcement learning to match OpenAI o1 — at 95% less cost

The company developed DeepSeek-R1 by using pure reinforcement learning on top of DeepSeek-V3-Base, and matched or beat o1 on some benchmarks.

New Qwen-2.5 Max Open Source AI Beats Deepseek and OpenAI

Qwen-2.5 Max AI model by Alibaba outperforms DeepSeek-v3 and rivals GPT-4. Offering advanced coding, math, and vision-language solutions with

6don MSN

OpenAI’s new Operator AI agent can do things on the web for you

The agent will be available first in the US to subscribers of ChatGPT Pro.

Biometric Companies2d

OpenAI launches new AI agent Operator that can perform tasks independently

The AI agent is powered by Computer-Using Agent (CUA), a model combining GPT-4’s vision capabilities with advanced reasoning through reinforcement learning.

OpenAI debuts AI agent Operator to transform web task automation

AI agents have the potential to transform industries by automating tasks, personalizing interactions, and improving efficiency.

Interesting Engineering on MSN2d

China’s DeepSeek cracks ‘holy grails of AI’ to dethrone US’ Google, Meta, OpenAI

A Chinese startup's efficient AI development method challenges the approaches of US giants like OpenAI, Meta, and Google.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

OpenAI, DeepSeek

Organizations

People

Fields