Skip to content
DeepSeek Launches V4-Pro: Open-Source Model Outperforms Claude Opus 4.6 and GPT-5.4
AI3 min
10

DeepSeek Launches V4-Pro: Open-Source Model Outperforms Claude Opus 4.6 and GPT-5.4

AnthropicAnthropicSTARTUP

Chinese AI startup DeepSeek released a preview of its V4 model family, with the flagship V4-Pro boasting 1.6 trillion parameters and surpassing leading closed-source models in multiple benchmarks.

📝
CoinJP Editorial
0
CoinJP Editorial · 0 articles

DeepSeek's New Flagship Model Goes Live

On April 24, 2026, Chinese AI startup DeepSeek unveiled a preview of its V4 model family. The flagship DeepSeek-V4-Pro outperformed Claude Opus 4.6 and GPT-5.4 across several benchmarks, establishing itself as the strongest open-source model currently available.

"🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length. 🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models. 🔹 DeepSeek-V4-Flash: 284B total / 13B active params." — DeepSeek (@deepseek_ai), original post

Why This Matters

An open-source model competing head-to-head with the best proprietary offerings from Anthropic, OpenAI, and Google represents a significant shift in the AI landscape. With V4-Pro's weights and architecture publicly available, researchers and companies worldwide can deploy and fine-tune a top-tier system without relying on proprietary APIs. For the crypto industry — where AI agents are increasingly being integrated into trading strategies and DeFi protocols — this provides a powerful and cost-efficient tool.

Architecture and Scale

V4-Pro contains approximately 1.6 trillion parameters, but only 49 billion are activated during each inference step. The smaller sibling, V4-Flash, has 284 billion total parameters with 13 billion active per step.

Both models use a Mixture of Experts (MoE) architecture, where only the neural sub-networks relevant to the current task are engaged when processing each token. This approach significantly reduces computational costs while maintaining performance comparable to fully dense architectures.

Pre-training was conducted on a corpus exceeding 32 trillion tokens. The team then applied staged fine-tuning, dedicating separate blocks to coding, mathematics, logical reasoning, and instruction-following. The final model unifies these capabilities through distillation.

DeepSeek V4 resource efficiency comparison
Resource efficiency metrics for V4 models when handling long context. Source: Hugging Face

Long Context at a Fraction of the Cost

The defining feature of V4 is its dramatic reduction in the cost of processing long sequences. While a 1 million token context window is available from competitors, using it typically involves substantial costs and latency.

According to DeepSeek, V4-Pro requires only about 27% of the compute and 10% of KV-cache memory compared to V3.2 when working at maximum context length. V4-Flash achieves even greater efficiency — approximately 10% of compute and 7% of memory.

These gains stem from a hybrid attention architecture employing two data compression mechanisms to reduce overhead when processing long texts. The team also utilized specialized hyperconnections for training stability and the Muon optimizer for faster convergence.

Three Reasoning Modes and Agent Capabilities

The V4 models support three distinct reasoning modes:

  • Non-think — instant responses to straightforward queries without additional analysis;
  • Think High — deep analysis for complex tasks and planning;
  • Think Max — full mode where the model details every step and evaluates all possible paths.

For agentic tasks, Think Max now preserves the chain of intermediate steps within a single task. In the previous version, portions of this context were lost during user interactions.

Benchmark Results

According to DeepSeek's published data, V4-Pro delivered competitive performance across a broad range of evaluations:

  • Coding: a Codeforces rating of 3206 — 23rd among human programmers worldwide, on par with GPT-5.4;
  • Mathematics: 95.2 on HMMT 2026 and 89.8 on IMOAnswerBench, ahead of most competitors;
  • Knowledge (SimpleQA Verified): 57.9 (Opus 4.6 scored 46.2, though Gemini 3.1 Pro reached 75.6);
  • Software development (DeepSeek internal test): 67% — between Sonnet 4.5 (47%) and Opus 4.5 (70%);
  • Agentic scenarios: V4-Pro-Max achieved 80.6% on SWE Verified and 67.9% on Terminal Bench.
DeepSeek V4-Pro benchmark results
V4-Pro benchmark results compared to competitors. Source: Hugging Face

The V4 models were specifically trained on practical work scenarios including data analysis, report generation, document editing, and iterative web search with tool use. In an internal survey of 85 DeepSeek developers and researchers, 52% said they were ready to adopt V4-Pro as their primary coding model, while an additional 39% indicated they were leaning toward doing so.

The DeepSeek V4 release came just one day after OpenAI launched GPT-5.5 on April 23, which was positioned as "a new level of intelligence for real work and agent management." The race among leading AI labs continues to intensify.

ai-benchmarksanthropicartificial-intelligencedeepseeklarge-language-modelsopen-source-aiopenai

Frequently Asked Questions

What is DeepSeek V4-Pro?

DeepSeek V4-Pro is the flagship language model from Chinese AI startup DeepSeek, released on April 24, 2026. It features approximately 1.6 trillion total parameters with only 49 billion active per inference step, using a Mixture of Experts architecture.

How does DeepSeek V4 compare to GPT-5.4 and Claude Opus?

According to DeepSeek's benchmarks, V4-Pro achieved parity with GPT-5.4 in coding tasks (Codeforces rating 3206) and outperformed Claude Opus 4.6 in mathematics and knowledge tests. However, it scored below Gemini 3.1 Pro on SimpleQA Verified (57.9 vs. 75.6).

What is the context window of DeepSeek V4?

DeepSeek V4 supports a context window of 1 million tokens. V4-Pro uses only about 27% of compute and 10% of KV-cache memory compared to V3.2 at maximum context, making long-context processing significantly more cost-effective.

Is DeepSeek V4 open source?

Yes, DeepSeek V4 has been released as an open-source model. This makes it the most capable open-source language model available, competing directly with closed-source systems from OpenAI, Anthropic, and Google.

What reasoning modes does DeepSeek V4 support?

DeepSeek V4 offers three reasoning modes: Non-think for quick simple answers, Think High for deep analysis of complex tasks, and Think Max for maximum detail where the model traces every step and evaluates all possible paths.

Read also

AI

OpenAI Secures Record $110 Billion Round at $730 Billion Valuation

OpenAI closed the largest startup funding round in history at $110 billion, backed by Amazon, SoftBank, and Nvidia, with a $730 billion valuation.

4 min·🔥 1
AI

Trump Orders All Federal Agencies to Drop Anthropic Technologies Within Six Months

Federal agencies have 6 months to drop Anthropic's Claude AI amid ethics clashes. See how xAI and Pentagon deals reshape the landscape.

3 min·🔥 1
AI

AI Audit Uncovers Critical Liveness Bug in Ethereum's Nethermind Client

Octane Security's AI discovered a high-severity vulnerability in the Nethermind execution client that could have halted block production for 38% of Ethereum mainnet validators. The Ethereum Foundation awarded a maximum $50,000 bounty.

3 min·🔥 1
Analytics

Weekly Recap: NYT Satoshi Investigation, North Korean Hackers in DeFi, and Anthropic's AI 'Escape'

Bitcoin climbed above $71,000, a NYT journalist named Adam Back as Satoshi Nakamoto, ZachXBT exposed a network of North Korean IT agents in crypto projects, and Anthropic shelved its new AI model after it escaped a sandbox and found thousands of zero-day vulnerabilities.

5 min·🔥 0
Market

Drift Protocol Hacked for $280M, Google Lowers Quantum Threat Estimate — Weekly Recap

Bitcoin held steady at $67,000, North Korean hackers stole $280M from Drift Protocol, Anthropic leaked Claude Code source, and Google drastically reduced quantum attack threshold estimates for crypto.

5 min·🔥 0
AI

Anthropic Weakens AI Safety Commitments Amid Pentagon Ultimatum Over Military Use

Anthropic dropped its core AI safety pledge as the Pentagon set a Feb 27 deadline for unrestricted Claude access. What this means for the industry.

5 min·🔥 1