None
Gemini 3.1 Flash TTS: New text-to-speech AI model
Today, we’re introducing Gemini 3.1 Flash TTS, the latest text-to-speech model that delivers improved controllability, expressivity and quality — empowering…
A new AI model for the agentic era
A note from Google and Alphabet CEO Sundar Pichai: Information is at the core of human progress. It’s why we’ve…
Updates to Veo, Imagen and VideoFX, plus introducing Whisk in Google Labs
While video models often “hallucinate” unwanted details — extra fingers or unexpected objects, for example — Veo 2 produces these…
2.0 Flash, Flash-Lite, Pro Experimental
In December, we kicked off the agentic era by releasing an experimental version of Gemini 2.0 Flash — our highly…
Google’s new open model based on Gemini 2.0
For a deeper dive into the technical details behind these capabilities, as well as a comprehensive overview of our approach…
Our most capable open models to date
At the edge, our E2B and E4B models redefine on-device utility, prioritizing multimodal capabilities, low-latency processing and seamless ecosystem integration…
Our newest Gemini model with thinking
Last updated March 26 Today we’re introducing Gemini 2.5, our most intelligent AI model. Our first 2.5 release is an…
Lyria 3 expands to more Google products, adds more features
Providing new places to generate music High-quality music generation should be accessible wherever creativity happens. Whether you are an app…
Google’s latest AI audio model
Today, we’re advancing Gemini’s real-time dialogue capabilities with Gemini 3.1 Flash Live, our highest-quality audio and voice model yet. It…
How AI can decipher dolphin communication
Sharing DolphinGemma with the research community Recognizing the value of collaboration in scientific discovery, we’re planning to share DolphinGemma as…

