All about technology. — All about artificial intelligence.

Could Chinese Open-Source AI Be on Top?

Last week, the AI tech sphere experienced a significant shake-up with the DeepSeek Model Release. The renowned Chinese lab, DeepSeek, showcased DeepSeek-R1 – an advanced reasoning AI model that challenges OpenAI's o1 in performance on benchmark tests. What sets DeepSeek apart in the AI realm is...

, and Administrator

2025 July 10 . 10:39 AM

3 min read

Is the Open-Source Advantage of Chinese AI Emerging Victorious?

Could Chinese Open-Source AI Be on Top?

The Chinese AI company DeepSeek, based in Hangzhou, has made waves in the tech world with its innovative large language model (LLM) known as DeepSeek. Launched in January 2025, DeepSeek's core model, DeepSeek-R1, has disrupted the AI landscape by demonstrating exceptional performance at a fraction of the cost of traditional models.

### What is DeepSeek?

DeepSeek-R1, released under an MIT License, is a transformer-based model that primarily relies on reinforcement learning for its development. Unlike many other models, DeepSeek R1 was trained predominantly via pure reinforcement learning with rewards, minimising human post-training intervention. This approach enabled the model to develop reasoning capabilities more independently.

The model also incorporates mixture of experts (MoE) layers to significantly reduce training costs while maintaining high performance. DeepSeek offers several versions of R1, including R1-Zero, Hybrid R1, and distilled smaller models optimised for consumer-grade hardware.

### Significance

DeepSeek's training cost is dramatically lower than competitors. The base models were trained for around US$6 million, compared to OpenAI's estimated US$100 million for GPT-4. It also uses roughly one-tenth the computing power of Meta's similar LLM, Llama 3.1.

Trained during AI chip export restrictions to China, DeepSeek pushed hardware efficiency boundaries by using less powerful chips and fewer units. This breakthrough in cost efficiency and training strategy caused significant disruption in the AI industry, notably impacting Nvidia’s stock due to fears over reduced demand for high-end AI chips.

### Impact on the AI Industry

DeepSeek-R1's reasoning and coding capabilities match leading models like OpenAI's GPT-4, Google Gemini-2.5, and Anthropic's Claude Opus 4, tying for first place in real-time coding benchmarks such as the WebDev Arena. Despite lower training costs, DeepSeek maintains performance levels comparable to these giants, making it a formidable competitor.

DeepSeek's open-weight model policy encourages developer engagement and transparency, contrasting with some proprietary approaches. The open-weight approach allows developers to customise the model to their specific needs, fostering innovation and collaboration within the AI community.

### Comparison with OpenAI

| Aspect | DeepSeek | OpenAI | |----------------------------|----------------------------------------------|-------------------------------------------------| | Model Basis | Transformer-based, reinforcement learning focused | Transformer-based, human-supervised fine-tuning plus reinforcement learning | | Training Cost | Approx. US$6 million for base models | Approx. US$100 million for GPT-4 | | Training Strategy | Mostly pure reinforcement learning, minimal human tuning | Extensive human fine-tuning before and after RL | | Performance (Coding) | Comparable to GPT-4, Google Gemini-2.5 | Industry-leading, high coding and reasoning ability | | Openness | Open weight model, shared parameters | Generally proprietary, with limited weight sharing | | Hardware Efficiency | Uses weaker chips and fewer units due to export restrictions | Uses powerful AI hardware, high energy compute resource consumption |

In conclusion, DeepSeek represents a major shift in AI development by prioritising cost-efficient, largely self-supervised reinforcement learning, while matching top models in performance. Its open-weight approach and ability to operate under hardware constraints highlight its innovative edge. This has caused notable disruption in the industry, challenging entrenched players like OpenAI and Nvidia, and signalling a more diversified and competitive AI ecosystem ahead.

DeepSeek is not only significant for its technological innovations but also for reshaping economic and strategic aspects of AI development globally. The discussion about DeepSeek's impact on the tech community, including concerns and debates, will be left for a later issue. DeepSeek has made an impression, or caused concern, at Meta. DeepSeek-R1 rivals OpenAI's o1 in competitive benchmark performance.

The Chinese AI company DeepSeek, with its innovative large language model (LLM), DeepSeek-R1, is disrupting the AI industry by offering exceptional performance at a dramatically lower cost compared to competitors, such as OpenAI's GPT-4, while maintaining similar reasoning and coding capabilities.
DeepSeek's model development strategy is unique in the AI landscape, relying on reinforcement learning for its transformer-based DeepSeek-R1 model, which is trained predominantly via pure reinforcement learning with rewards, minimizing human post-training intervention.
The impact of DeepSeek on the business of hardware manufacturers like Nvidia is evident as the company's stock has been affected due to fears over reduced demand for high-end AI chips, as DeepSeek's models can operate efficiently using less powerful chips and fewer units.

Latest

Sequential number patterns at multidimensional trucking corporation TFI International could signify...

All about technology.

trucking company TFI International's sequence of figures could signal a shift in direction

Investment community views the consecutive figures reported by TFI during the second quarter as favorable.

, and Administrator

2025 July 31

All about technology.

Latest Product Highlights for July 2025: MES Edition

Enhance your food production line efficiency with Rockwell Automation's Augmentable MES Plex MES, which allows for the addition of modular solutions to oversee various shop floor processes seamlessly.

, and Administrator

2025 July 31

Job cuts at Poshmark impact fewer than 2% of employees

All about technology.

Poshmark efficiently reduces workforce by a minimal 2%

Acquired for a whopping $1.2 billion, the second-hand marketplace boasts a workforce of approximately 800 employees, as per information on LinkedIn.

, and Administrator

2025 July 31

Disabling Political Advertisements on Social Networking Platforms: A Step-by-Step Guide

All about technology.

Disabling Political Advertisements on Social Network Platforms: A Step-by-Step Guide

Managing social media can at times be a daunting task, particularly when your timelines are flooded with political advertisements. These ads, although often necessary, can sometimes be unwanted or mismatched with your preferences. Luckily, prominent social media networks offer features to limit...

, and Administrator

2025 July 31

Could Chinese Open-Source AI Be on Top?

Could Chinese Open-Source AI Be on Top?

Read also:

Related

Latest