DeepSeek is preparing to launch its next-generation large language model, DeepSeek-R2, sometime between August 15 and August 30, 2025. The launch window came to light during an interaction with DeepSeek’s own AI model, hinting at an imminent rollout just days after the release of ChatGPT-5.

DeepSeek-R2 is expected to make a significant leap in architecture by adopting a more advanced Mixture of Experts (MoE) setup. It will also integrate a smarter gating network to better handle inference-heavy workloads. According to sources familiar with its development, the model could scale up to 1.2 trillion parameters, nearly double that of DeepSeek-R1, which had 671 billion. This will still be lower than ChatGPT-4/5, which has over 1.8 trillion parameters.
China looking for domestic AI self-sufficiency
In a strategic move toward domestic AI self-sufficiency, the model has been entirely trained on Huawei’s Ascend 910B chips. Huawei’s compute cluster, delivering 512 PFLOPS of FP16 performance at 82% utilization, reportedly achieves 91% of the performance of Nvidia’s A100 cluster. Analysts see this as a critical step in reducing China’s reliance on US-made AI hardware.
DeepSeek-R2’s training reportedly cost 97% less than GPT-4, thanks to local hardware and optimization techniques. Analysts expect DeepSeek to offer API access at lower prices, which could shake up current pricing models dominated by OpenAI and Anthropic.
The anticipation around DeepSeek-R2 has already sparked movement in Chinese tech stocks. AI chipmaker Cambricon saw a 20% surge in its share price, pushing its market cap past 355 billion yuan (around $49.7 billion).
In related news, Huawei has rolled out a new AI inference framework called Unified Cache Manager (UCM). Designed to accelerate model inference, UCM optimizes how KV Cache data is handled across memory tiers like HBM, DRAM, and SSDs. In tests with China UnionPay, Huawei reported up to a 90% reduction in latency and a 22x boost in throughput. The company plans to open-source UCM in September.
Together, the launch of DeepSeek-R2 and the introduction of Huawei’s UCM framework signal a major shift in China’s AI ambitions. These developments point to a future where China builds and runs high-performance AI systems without relying on Western chips or software tools.
For more daily updates, please visit our News Section.
Stay ahead in tech! Join our Telegram community and sign up for our daily newsletter of top stories! 💡
(Via)







Comments