• AI Search
  • Cryptocurrency
  • Earnings
  • Enterprise
  • About TechBooky
  • Submit Article
  • Advertise Here
  • Contact Us
TechBooky
  • African
  • AI
  • Metaverse
  • Gadgets
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
  • African
  • AI
  • Metaverse
  • Gadgets
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
TechBooky
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Home Artificial Intelligence

DeepSeek Locks in 75% Price Cut on V4 Pro, Undercutting Western AI Models by up to 25x

Paul Balo by Paul Balo
May 29, 2026
in Artificial Intelligence
Share on FacebookShare on Twitter

DeepSeek has made permanent a 75% price cut on its flagship V4 Pro model, in a move that directly targets the cost structures behind today’s largest AI systems. The company’s new pricing tiers put its models far below comparable offerings from major Western labs that are widely used in enterprise production.

According to the company’s published pricing comparisons, DeepSeek V4 Pro now comes in at around seven times cheaper on input tokens and 17 times cheaper on output tokens than models such as Anthropic’s Claude Sonnet and OpenAI’s GPT 5.5-Med. For organisations running large-scale workloads, that gap translates into significantly lower operating costs for similar classes of capability.

DeepSeek is also pushing aggressively at the lower end of the stack. Its V4 Flash model, a lighter, speed-optimised variant  is priced to undercut entry-tier options like Claude Haiku by roughly 10x to 25x. That positions V4 Flash as a budget-conscious choice for use cases that prioritise throughput and latency while still drawing on the same overall model family.

The pricing shifts are not positioned as a temporary promotion but as the output of architectural changes. DeepSeek attributes the cuts to a set of hardware–software optimisations, particularly around cache, that make its models more efficient to run at scale. While the company has not detailed every element of the stack in the provided material, it links the lower per-token prices directly to these efficiency gains.

The cost differential is especially stark when DeepSeek’s models are hosted natively in China. In that configuration, the company’s cache-read pricing is described as being 87 times cheaper than Western cloud offerings. That level of discount effectively sets a new price floor for cached inference in those regions, with implications for anyone running long-context or cache-heavy workloads.

The ripple effects are already visible among Chinese hardware and platform providers. Handset maker Xiaomi has moved to match DeepSeek’s cache-read pricing tier for its newly deployed MiMo architecture, mirroring the same level rather than trying to undercut it further. That indicates at least one major player sees DeepSeek’s pricing as a new reference point for AI infrastructure in its home market.

DeepSeek is pairing its pricing story with benchmark data aimed at showing that V4 Pro is not just cheaper, but competitive on quality. The company’s model card for DeepSeek V4 Pro highlights external evaluations placing it close to Western frontier systems on several technical measures.

On coding-agent tasks, DeepSeek V4 Pro records a score of 80.6% on the SWE-bench Verified leaderboard, a benchmark that tracks performance on software engineering-related challenges. That result is presented as putting the model almost on par with top-tier Western systems that target similar workloads in enterprise development and automation.

For broader reasoning and technical understanding, DeepSeek cites an 87.5 score on the advanced MMLU-Pro technical index, a demanding benchmark used to assess higher-level reasoning across specialised domains. That figure places V4 Pro in what DeepSeek describes as the “elite” range on that test, reinforcing the argument that its pricing does not come at the expense of capability.

Both V4 Pro and V4 Flash belong to the same model family, with V4 Pro aimed at more demanding tasks and V4 Flash tuned for speed. DeepSeek characterises V4 Flash as a hyper-optimised, fast variant intended for deployments where responsiveness and cost-per-call are critical.

The combination of aggressive token pricing, cache-read discounts in China, and benchmarked performance near Western frontier models positions DeepSeek as a cost-focused challenger in the global AI ecosystem. How far that pressure reshapes pricing and infrastructure strategies elsewhere remains to be seen, but the new floor it has set particularly around cached inference is now public and explicit.

Related Posts:

  • chatgpt-logo
    OpenAI Launches GPT-5.4 Mini and Nano Models
  • deepseek-ai-record
    China's DeepSeek Finally Launches A New AI Model
  • alibaba qwen
    Alibaba Expands Qwen Lineup with New Mid-Sized AI Models
  • assets_task_01jryqpar7fd1vr3zjb9wj416t_img_0
    OpenAI Unveils GPT-4.1, Its Flagship AI Model
  • DO3EOFAEMFNYHCIFVH2KMVCOVI
    DeepSeek Update Threatens Google and ChatGPT Dominance
  • GettyImages-2196333417_75e106
    DeepSeek Launches Advanced AI to Rival Google and OpenAI
  • deepseek2-1024x640
    DeepSeek Launches R1 Reasoning Model on Hugging Face
  • Grok-X-deepfakes-Elon-Musk-1024x576
    xAI Rolls out Grok 4.3 and a New Voice Cloning Suite

Discover more from TechBooky

Subscribe to get the latest posts sent to your email.

Tags: AIdeepseekDeepSeek-V4-Pro
Paul Balo

Paul Balo

Paul Balo is the founder of TechBooky and a highly skilled wireless communications professional with a strong background in cloud computing, offering extensive experience in designing, implementing, and managing wireless communication systems.

BROWSE BY CATEGORIES

Receive top tech news directly in your inbox

subscription from
Loading

Freshly Squeezed

  • DeepSeek Locks in 75% Price Cut on V4 Pro, Undercutting Western AI Models by up to 25x May 29, 2026
  • Mistral AI Targets Enterprise with Industrial Push, New Data Center and Assistant Rebrand May 29, 2026
  • Microsoft 365 Copilot Receives Faster Performance Features & Redesigned Look May 29, 2026
  • Anthropic Surges To A $965 Billion Valuation, Overtaking OpenAI May 29, 2026
  • Google Rolls Out Media App Switcher For Android Auto May 29, 2026
  • Bluesky Adopts Long-Form Content To Rival X Articles May 29, 2026
  • Meta Rolls Out Subscriptions For Instagram, Facebook, & WhatsApp, With AI Plans May 28, 2026
  • TELCOs (Airtel & Glo) Resumes Airtime Borrowing To Customers May 27, 2026
  • Over 185,000 Affected By 7-Eleven Data Breach May 26, 2026
  • Kenya’s $21 Million Appeal To Track Social Media May 26, 2026
  • Huawei Reveals New Chip Strategy to Beat US Sanctions and Challenge Nvidia May 25, 2026
  • Pope Leo XIV Urges AI Rules that Protect People, not Concentrate Power May 25, 2026

Browse Archives

May 2026
MTWTFSS
 123
45678910
11121314151617
18192021222324
25262728293031
« Apr    

Quick Links

  • About TechBooky
  • Advertise Here
  • Contact us
  • Submit Article
  • Privacy Policy
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
  • African
  • Artificial Intelligence
  • Gadgets
  • Metaverse
  • Tips
  • AI Search
  • About TechBooky
  • Advertise Here
  • Submit Article
  • Contact us

© 2025 Designed By TechBooky Elite

Discover more from TechBooky

Subscribe now to keep reading and get access to the full archive.

Continue reading

We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.