DeepSeek: DeepSeek V4 Pro

deepseek/deepseek-v4-pro

Released Apr 24, 20261,048,576 context$1.74/M input tokens$3.48/M output tokens

DeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with 1.6T total parameters and 49B activated parameters, supporting a 1M-token context window. It is designed for advanced reasoning, coding, and long-horizon agent workflows, with strong performance across knowledge, math, and software engineering benchmarks.

Built on the same architecture as DeepSeek V4 Flash, it introduces a hybrid attention system for efficient long-context processing and supports multiple reasoning modes to balance speed and depth depending on the task. It is well suited for complex workloads such as full-codebase analysis, multi-step automation, and large-scale information synthesis, where both capability and efficiency are critical.

DeepSeek: DeepSeek V4 Pro