DeepSeek R1 Soars to Top 3, Outshining Gemini—An Open-Source Triumph in Coding, Math, and Logic

In a groundbreaking update, DeepSeek R1 has surged into the top three AI models globally, surpassing even Google’s latest Gemini 2.5 Pro.

DeepSeek R1’s ascent is rooted in its enhanced reasoning capabilities. On the AIME 2025 math test, its accuracy soared from 70% to 87.5%, outpacing Gemini 2.5 Pro and rivaling OpenAI’s o3, according to internal benchmarks.
The model now processes an average of 23,000 tokens per question — nearly double its previous capacity — demonstrating deeper analytical prowess without architectural changes.
In coding, DeepSeek R1 achieved a Codeforces rating of 2,029, outperforming 96.3% of human programmers and closing the gap with o3, while its open-source nature makes it far more cost-effective. General logic also saw gains, with GPQA-Diamond scores rising from 71.5% to 81.0%, cementing its edge over Gemini.

Its operational costs are estimated to be 90–96% lower than OpenAI’s o3, making it a game-changer for startups, researchers, and businesses. The model’s reinforcement learning-based architecture, refined through large-scale training, enables it to self-correct and reason step-by-step, offering transparency that proprietary models often lack.
This update also introduced practical features like JSON output support and reduced hallucination rates by 45–50%, enhancing reliability for real-world applications. From solving advanced calculus to generating production-ready code, DeepSeek R1 is proving its mettle across industries.
A distilled version, DeepSeek-R1-0528-Qwen3-8B, further showcases its efficiency, scoring 86% on AIME 2024 while running on minimal resources like an Nvidia H100.
Also read:
- The True Brake on Civilization: Coordination Limits Are Rapidly Shifting
- The Illusion of Normalcy: AI’s Imminent Impact on the Job Market and Economy
- Apple AirPods Set to Gain Camera Control, Sleep Detection, New Gestures, and More at WWDC 2025

Can open-source models sustain this momentum against the deep pockets of OpenAI and Google?
While its performance is undeniable, concerns linger about data privacy, given DeepSeek’s Chinese origins and potential compliance with local laws.
Nevertheless, DeepSeek R1’s blend of power, affordability, and openness positions it as a beacon for global innovation, proving that the future of AI may not belong to the few, but to the many.