Everything Google Announced at I/O 2025

Discover all major announcements from Google I/O 2025, including Gemini 2.5, Project Astra, AI in Search, Imagen 4, Stitch, and more AI-powered tools.

Everything Google Announced at I/O 2025

Google I/O 2025: The AI-First Era Is No Longer Coming — It's Already Here

There is a particular kind of tech event that leaves you feeling slightly different about the future when it is over. Not because of one big announcement, but because of the sheer weight of everything piled together — the cumulative sense that something has genuinely shifted. Google I/O 2025 was that kind of event.

Held in May at the Shoreline Amphitheatre in Mountain View, California, this year's developer conference was less of a product showcase and more of a declaration. Google CEO Sundar Pichai took the stage and, within the first few minutes, made the company's position crystal clear: artificial intelligence is no longer a feature Google is adding to its products. It is the product. The search engine, the assistant, the creative tools, the development platforms — all of it is being rebuilt, from the ground up, around AI.

What followed was one of the most packed keynotes in Google I/O history. New AI models. A reimagined search experience. A real-time multimodal assistant that can see what you see. Cinematic video generation with native audio. An AI tool that turns a rough sketch into a functioning app interface in minutes. And much more.

This is not a summary of a tech conference. It is a guided walk through the decisions that will quietly reshape how billions of people interact with information, creativity, and technology over the next few years. Let's go through it properly.


Before We Start: The Number That Sets the Scene

Sundar Pichai opened with a milestone that would have seemed unthinkable five years ago: Gemini, Google's family of AI models and apps, now has over 400 million monthly active users. That is not a niche developer tool or a beta product for early adopters. That is a product at scale — growing, expanding, and becoming part of how people actually live and work.

That number matters because it tells you why Google is moving so aggressively. This is not experimentation anymore. This is deployment. And everything announced at I/O 2025 is designed to push that number higher and make every one of those 400 million users more dependent on, and more delighted by, what Google's AI can do.


1. Gemini 2.5 Pro and Deep Think: When AI Actually Stops to Think

Let's start with the engine that powers everything else.

Gemini 2.5 Pro is Google's most capable AI model to date, and at I/O 2025, it got a significant upgrade in the form of a new reasoning mode called Deep Think. The name sounds a little dramatic, but the reality behind it is genuinely impressive — and worth understanding properly.

Most AI models, when you ask them something, generate a response by predicting the most likely next token, step by step, in one continuous forward pass. It is fast, it works well for most things, but it struggles with problems that require genuine multi-step reasoning — complex mathematics, competitive coding challenges, situations where the first instinct is often wrong and the right answer requires considering multiple paths before committing.

Deep Think changes this. According to Demis Hassabis, the head of Google DeepMind, who spoke at I/O, Deep Think uses what he described as "parallel thinking techniques" — essentially allowing the model to consider multiple hypotheses simultaneously before settling on a response. Rather than committing to the first plausible answer, it holds several possibilities open, evaluates them, and selects the most defensible one.

The benchmark results are striking. Gemini 2.5 Pro with Deep Think topped LiveCodeBench, one of the most respected evaluations for competitive-level coding. It outperformed OpenAI's o3 model on MMMU, a test that measures multimodal reasoning — the ability to combine visual and textual understanding. It also achieved an impressive score on the 2025 USAMO, the USA Mathematical Olympiad — currently considered one of the most difficult AI math benchmarks in existence.

Google was transparent about the fact that Deep Think is not yet publicly available. Because it sits at the frontier of model capability, the company is taking extra time on safety evaluations before rolling it out broadly. For now, it is available to trusted testers via the Gemini API, with a wider release expected to follow.

What about Gemini 2.5 Flash? Flash is the speed-optimised sibling in the Gemini family — designed for scenarios where you need fast, efficient responses rather than deep deliberation. Think voice assistants, mobile apps, real-time tools. The updated Flash model received improvements across coding, multimodality, reasoning, and long-context handling, while also becoming more efficient than the version it replaced. It is now available in Google AI Studio and Vertex AI.

Together, Pro and Flash represent two ends of a spectrum: one for depth, one for speed. Google is betting that the future requires both — and that having the best model on each end gives it a decisive edge.


2. AI Mode in Search: The End of the Ten Blue Links?

For 25 years, Google Search looked more or less the same. You typed something, you got a list of links, you clicked through. Variations and improvements happened along the way — featured snippets, knowledge panels, AI Overviews — but the fundamental model of "query → results → click" stayed intact.

AI Mode is Google's most serious challenge to that model yet. And it is worth being honest about what it represents: a bet that users will increasingly prefer a generative, conversational answer to a list of pages to scroll through.

Here is how it works. When you activate AI Mode in Search — currently via a dedicated tab — you are handed off from the traditional results experience to one powered by Gemini 2.5. Instead of links, you get a rich, synthesised response that draws from across the web and presents information in a way that directly addresses what you were actually trying to find out. For complex, multi-part questions — the kind that previously required you to open six tabs and piece together your own answer — this is genuinely useful.

The nuance is in the activation. Google made a deliberate choice to put AI Mode behind an extra tap rather than making it the default. Critics have read this as caution, and they are probably right. Google has too much at stake — both in terms of advertiser relationships and in terms of user trust — to go all-in on generative answers without watching what happens. The existing search experience still works. The AI Mode is additive, not a replacement.

But the direction is unmistakable. AI Overviews alone are now available in over 200 countries and territories. The infrastructure for AI-first search is being built at scale, one feature at a time.

There is also a new capability worth noting: within AI Mode, Google has integrated what it calls Search Live, powered by Project Astra (more on that shortly). It lets users point their phone camera at something in the real world and ask questions about it in real time — getting an AI-powered answer without typing a single word. That is a fundamentally different interaction model, and it suggests where this is all headed.


3. Project Astra: The AI That Sees What You See

If Gemini 2.5 is the brain, Project Astra is the eyes and ears.

First unveiled at Google I/O 2024 through a now-famous smart glasses demo that went viral, Project Astra is Google DeepMind's vision for a universal AI assistant that operates in real time, across multiple sensory inputs simultaneously. At I/O 2025, it moved from a research demo to something much closer to a deployed product.

The core idea is this: most AI assistants, even good ones, operate on text. You type something, they respond. Even voice assistants are essentially text systems with a speech layer on top. Astra is different. It can process live video, live audio, and the surrounding context at the same time — and respond to all of it with very low latency.

The practical applications demonstrated at I/O were revealing. Imagine you are looking at a complex diagram on a whiteboard and ask Astra what it means — it explains it. You point your camera at a broken household appliance and ask what went wrong — it identifies the issue and suggests a fix. You are walking in an unfamiliar city and ask what the building in front of you is — it tells you, with context, without you stopping to type anything.

This is what Google means when it uses the phrase "understanding the world like a person would." Astra does not just respond to prompts — it is continuously aware of its environment and can draw on that awareness to be genuinely helpful in the moment.

At I/O 2025, Astra's capabilities were integrated into several existing products. It now powers Gemini Live, the real-time conversational mode of the Gemini app that supports live video calls with the AI, available in over 45 languages across 150+ countries. It powers the Search Live feature described above. And Google opened Astra's capabilities to third-party developers via the API, which means the ecosystem of Astra-powered apps is just beginning.

The smart glasses angle remains the long-term vision. The demos shown at I/O suggested wearable devices that could bring Astra's real-time multimodal awareness into your field of vision — all the time, seamlessly. Pricing and availability details were not shared. But the direction is clear: Google is building toward a world where your AI assistant is not on your phone, it is in your glasses, and it sees what you see, hears what you hear, and helps you in the moment you need it.


4. Veo 3, Imagen 4, and Flow: When Creativity Meets Scale

This section is for the creators — the filmmakers, the marketers, the storytellers, the educators, the brand designers — because what Google announced here is going to change how visual content gets made.

Imagen 4 is the latest version of Google's image generation model. The improvements over its predecessor are meaningful: sharper detail, better text rendering inside images (historically a weakness of generative models), and higher resolution output up to 2K. It powers image creation across the Gemini app and feeds directly into Flow.

Veo 3 is where things get genuinely exciting. It is Google's most advanced video generation model, and it does something that no competing system currently does: it generates audio natively alongside the video. Not as an afterthought added in post-production, but baked directly into the generation pipeline. Environmental sounds, background noise, dialogue, footsteps, ambient atmosphere — all of it generated together with the visual content.

Think about what that means practically. You describe a scene — a crowded street market in Mumbai at dusk, vendors calling out, rain just starting — and Veo 3 generates not just the visuals but the sound world that goes with them. That is a creative tool of a different order than anything available before it.

Flow brings these capabilities together into a proper creative workflow. Think of it as a cinematic AI tool — not just a generator but an environment where you can create, direct, and assemble scenes. You describe shots, tweak them, string them together, iterate. The comparison Google reaches for is a film director editing a movie: Flow is designed to give that kind of creative control to anyone, regardless of technical background or budget.

Access to Veo 3 and Flow comes via subscription. Google AI Pro, priced at $19.99 per month, includes access to the filmmaking tools. Google AI Ultra, at $249.99 per month, unlocks higher usage limits and the full suite of premium features. The pricing reflects that these tools are initially positioned for serious creators and professionals — but the trajectory is clearly toward broader access over time.

It is also worth noting Google's partnership with Darren Aronofsky's production company Primordial Soup, which is producing three short films using Veo and other DeepMind generative tools. The first, called Ancestra, is directed by filmmaker Eliza McNitt and premiered at the Tribeca Festival in June 2025. Google's message here is deliberate: these tools are for artists, built with artists, not a replacement for human creativity but an expansion of what is possible.


5. Stitch: The Tool That Builds Apps From a Sketch

For developers, founders, and anyone who has ever had a product idea but no design background, Stitch may be the most immediately practical announcement from the entire event.

Stitch is an AI-powered UI design tool that takes natural language instructions, wireframes, rough sketches, or even image prompts — and turns them into polished, responsive front-end code and UI components for both mobile and desktop apps.

The demo shown at I/O was striking in its simplicity. A developer describes what they want — "a settings screen for a fitness app with toggle options for notifications and a dark mode switch" — and Stitch generates not just a mockup but actual usable code. It exports directly to Figma for designers who want to iterate further, or straight to code for developers who want to build immediately.

What makes Stitch genuinely significant is who it is designed for. The traditional app development process has a painful gap between the idea stage and the working prototype stage — a gap that currently requires either expensive designers, lengthy back-and-forth between design and engineering teams, or both. Stitch compresses that gap dramatically. For early-stage startups working with small teams, for solo founders building their first product, for enterprises doing rapid internal tool development, the time savings are real.

Stitch is still in Google Labs — meaning it is available for experimentation but not yet a production-ready tool. But early users have reported being genuinely surprised by how far it can take a rough concept before any manual work is needed. The potential to cut initial development timelines from weeks to hours is not an exaggeration.


6. The Things That Did Not Make the Headlines (But Should Have)

Beyond the five major announcements, Google I/O 2025 was packed with smaller updates that deserve mention because they reveal the breadth of where AI is being applied.

Google Beam — an evolution of the Project Starline 3D video communication technology — brings immersive, depth-aware video calling to enterprise settings. The technology uses AI to create a genuinely three-dimensional representation of the person you are talking to, eliminating the flat, disconnected quality of standard video calls.

Jules, a background coding assistant, connects directly to existing code repositories and handles tasks like writing tests, fixing bugs, and building new features — working asynchronously while you focus on other things.

SynthID, Google DeepMind's tool for watermarking AI-generated content, has already been applied to over 10 billion pieces of content since its launch. A new SynthID Detector portal makes it possible for journalists, researchers, and media professionals to verify whether a given image, video, audio clip, or text document was AI-generated. In a world where the volume of synthetic media is growing exponentially, this is becoming infrastructure, not just a feature.

Gemini in Chrome brings the assistant directly into the browser, enabling it to help with whatever is on your screen without switching context. It reads the page you are on and offers assistance — summarising, translating, extracting information — without you having to copy and paste anything.

LearnLM is a family of educationally optimised models built into Gemini 2.5 that personalise learning experiences based on how individuals engage with content. Google is positioning Gemini as a study tool that adapts to the learner rather than presenting a one-size-fits-all response.


What It All Adds Up To

Stand back from the individual announcements and a coherent picture emerges: Google is not just adding AI to things it already built. It is rebuilding its entire product surface with AI as the foundation.

Search, which has been Google's core business for a quarter century, is being reimagined around generative answers. Its assistant ambitions are being realised through Project Astra's real-time multimodal capabilities. Its creative tools — Imagen, Veo, Flow — are competing directly with a generation of AI-native startups. Its developer platform, through tools like Stitch and Jules, is making it easier to build on Google's infrastructure.

The AI Ultra subscription at $249.99 per month is an interesting signal. Google is betting that a segment of users — creative professionals, developers, researchers, power users — will pay a significant premium for access to the most capable tools. That is a different revenue model from the advertising-supported free tier that built Google's empire, and it suggests the company is watching what OpenAI, Anthropic, and others are building at the premium end of the market.

The 400 million monthly Gemini users are the foundation. Everything announced at I/O 2025 is designed to deepen those relationships, expand the use cases, and make Google's AI not just something people occasionally try but something they cannot imagine working without.


A Final Thought: The Quiet Shift

There is a version of AI announcements that feels exciting in the moment but hollow on reflection — shiny demos that never quite become real products, capabilities that work in the keynote and fail in the wild.

Google I/O 2025 felt different, and the reason is scale. When Gemini has 400 million monthly users, when AI Overviews reach 200 countries, when Project Astra moves from a viral demo into live Search and Gemini Live — the gap between announcement and reality is narrowing. These tools are not hypothetical. They are live, or nearly so.

The AI-first era is not a vision statement anymore. For better or worse, with all the questions about creativity, employment, information trust, and human agency that it raises — it is here. Google I/O 2025 was simply the clearest view we have had yet of what it looks like.