Google Launches Gemini 3 Flash, Its Fastest AI Yet

Image: Google
Google is expanding its next-generation AI lineup with the launch of Gemini 3 Flash, a new model designed to deliver frontier-level intelligence at significantly higher speeds and lower costs. Gemini 3 Flash builds on the reasoning and multimodal foundations of Gemini 3 Pro, while emphasizing efficiency and responsiveness across consumer and enterprise use cases.
“Today, we’re expanding the Gemini 3 model family with the release of Gemini 3 Flash, which offers frontier intelligence built for speed at a fraction of the cost,” Google said in its Wednesday announcement. The company added that Gemini 3 Flash combines “Gemini 3’s Pro-grade reasoning with Flash-level latency, efficiency and cost,” positioning it as the most capable option yet for agentic workflows and high-frequency tasks.
Google first launched Gemini 3 in November, rolling out Gemini 3 Pro and a new Deep Think mode focused on advanced reasoning. Adoption has ramped quickly since, with the Gemini 3 series debuting in Canada earlier this month. Google says it is now processing more than one trillion tokens per day via its API, with developers using Gemini to code simulations, design interactive games, and analyze complex multimodal content.
On benchmarks, Gemini 3 Flash punches well above its weight. Google says the model delivers 90.4% on GPQA Diamond, 33.7% on Humanity’s Last Exam without tools, and 81.2% on MMMU Pro, putting it close to Gemini 3 Pro despite running much faster. In coding, Gemini 3 Flash scored 78% on SWE-bench Verified, outperforming not only the 2.5 series but also Gemini 3 Pro in some scenarios.

Image: Google
Speed is where Flash really shines. According to Google, it “outperforms 2.5 Pro while being 3x faster … at a fraction of the cost,” using roughly 30% fewer tokens on average. Pricing reflects that focus on efficiency, with input tokens costing $0.50 per million and output tokens priced at $3 per million.
For everyday users, Gemini 3 Flash is now rolling out globally as the default model in the Gemini app, replacing 2.5 Flash. Google says this gives users “a major upgrade” at no cost, with stronger reasoning across text, images, video, and audio. Gemini 3 Flash is also becoming the default engine behind AI Mode in Search, which expanded to Canada earlier this fall, bringing faster, more nuanced answers that blend real-time information with actionable results.
Developers and enterprises can already access Gemini 3 Flash via Google AI Studio, Vertex AI, Gemini Enterprise, and the new Google Antigravity agentic platform. As Google continues to weave Gemini deeper into products like Translate and Search, Gemini 3 Flash signals a clear push toward making high-end AI faster, cheaper, and more widely available.
Want to see more of our stories on Google?
P.S. Want to keep this site truly independent? Support us by buying us a beer, treating us to a coffee, or shopping through Amazon here. Links in this post are affiliate links, so we earn a tiny commission at no charge to you. Thanks for supporting independent Canadian media!