Chinese Lab Unveils Reasoning AI Model to Compete with OpenAI’s o1

author
By Tanu Chahal

21/11/2024

cover image for the blog

A Chinese research lab has introduced DeepSeek-R1, a reasoning-based AI model designed to rival OpenAI’s o1. Released by DeepSeek, a company funded by quantitative traders, this model represents a significant step in advancing AI’s reasoning capabilities.

Reasoning models like DeepSeek-R1 are distinct from traditional AI systems. They excel at processing questions more thoughtfully, reducing errors by planning and analyzing before delivering answers. Similar to OpenAI’s o1, DeepSeek-R1 performs a sequence of reasoning tasks and may take several seconds to respond, depending on the complexity of the query.

DeepSeek claims that its model matches OpenAI’s o1 in performance on benchmarks like AIME and MATH. AIME uses other AI models to evaluate performance, while MATH assesses problem-solving through word problems. However, DeepSeek-R1 is not without flaws. Observers have noted its struggles with simpler logic games like tic-tac-toe, a challenge that also affects o1.

The model also faces issues with security and censorship. DeepSeek-R1 has been found to bypass safeguards under specific prompts, with users eliciting inappropriate outputs, such as instructions for illegal activities. Additionally, it avoids politically sensitive topics, likely due to strict government regulations in China. AI systems in the region must align with government-defined values and undergo reviews by internet regulators, which may explain DeepSeek-R1’s selective response limitations.

The development of reasoning models like DeepSeek-R1 reflects a broader shift in AI strategies. Traditional scaling laws—where increasing data and computing power consistently enhanced AI models—are now being questioned as improvements slow. This has prompted the exploration of alternative methods, such as test-time compute. This technique gives AI models more processing power during the inference phase, enabling better performance for complex tasks.

DeepSeek plans to make its model open source and release an API, aiming to expand accessibility. The company operates under High-Flyer Capital Management, a Chinese hedge fund that employs AI for trading strategies. DeepSeek’s earlier model, DeepSeek-V2, pushed competitors like ByteDance and Baidu to lower prices or make their models free.

High-Flyer Capital, led by Liang Wenfeng, invests heavily in AI infrastructure, with its latest training cluster boasting 10,000 Nvidia A100 GPUs and costing approximately $138 million. The company’s long-term goal is to create superintelligent AI through DeepSeek’s research.

As reasoning models gain attention, DeepSeek-R1 highlights the ongoing race to develop more sophisticated AI technologies that rely on innovative approaches like test-time compute.