News
Empowering Service Providers and Hardware Partners with Gemini for Home - Google Developers Blog
3+ day, 9+ hour ago (373+ words) At last year's Google I/O, we introduced the Gemini era for Google Home, moving beyond basic device control toward a home that acts proactively on your behalf. Today, we're excited to take the next step: making Gemini for Home…...
Announcing ADK for Kotlin and ADK for Android 0. 1. 0: Building AI Agents on Android and Beyond- Google Developers Blog
3+ day, 9+ hour ago (342+ words) ADK for Kotlin brings agentic workflows to your backend projects, while ADK for Android provides specialized on-device optimizations With just a few lines of Kotlin, you can: During our I/O session, we showcased how ADK for Kotlin powers an…...
A Smarter Google AI Edge Gallery: MCP integration, notifications, and session continuity
5+ day, 3+ hour ago (497+ words) Google AI Edge Gallery now supports the Model Context Protocol (MCP), notifications reminders, and persistent chat history'providing developers with a showcase to build connected, automated, on-device agentic experiences. This architecture allows the mobile device to coordinate complex tasks across various…...
Blazing fast on-device Gen AI with Lite RT-LM
5+ day, 4+ hour ago (403+ words) One of the most significant performance milestones in the Lite RT-LM pipeline is our native support for the Multi-Token Prediction (MTP) drafters recently launched with the Gemma 4 model family. By integrating this specialized speculative decoding architecture, Lite RT-LM bypasses traditional…...
One Year of Innovation: Celebrating 100k Members in the Google Cloud x NVIDIA Developer Community
5+ day, 4+ hour ago (239+ words) Last year at Google I/O "25, we launched the Google Cloud and NVIDIA developer community to bridge the gap between cutting-edge AI infrastructure and the builders shaping the future. Today, at Google I/O "26, we're celebrating 100, 000 members and we want…...
Announcing Genkit Middleware: Intercept, extend, and harden your agentic apps
1+ week, 3+ day ago (480+ words) Genkit solves this with middleware: composable hooks that intercept generation calls, including the tool execution loop, and inject custom behaviors. The middleware system is available today in Type Script, Go, and Dart, with Python support coming soon. Automatically retries failed…...
Accelerating on-device AI: A look at Arm and Google AI Edge optimization
1+ week, 3+ day ago (764+ words) Arm Scalable Matrix Extension 2 (SME2) eliminates this tradeoff by integrating a dedicated matrix-compute unit directly into the CPU cluster. This architecture enables the CPU to function as a high-performance AI accelerator, delivering up to 5x faster inference for the matrix-heavy workloads at…...
Build Long-running AI agents that pause, resume, and never lose context with ADK - Google Developers Blog
1+ week, 5+ day ago (1285+ words) Most agent tutorials end at a stateless chatbot " a conversational loop that forgets everything the moment the container restarts. Real enterprise workflows don't wrap up in a single API call. HR onboarding spans two weeks. Invoice disputes stall for days…...
Supercharging LLM inference on Google TPUs: Achieving 3 X speedups with diffusion-style speculative decoding
2+ week, 6+ day ago (706+ words) Here is a technical deep dive from the UCSD researchers detailing how they built this, their performance benchmarks, and what it means for the future of the Google TPU ecosystem. Standard LLM inference generates text autoregressively. This means the model…...
Speeding Up AI: Bringing Google Colossus to Py Torch via GCSFS and Rapid Bucket
3+ week, 4+ day ago (349+ words) Today, we are announcing a major performance boost for AI/ML workloads using the Py Torch ecosystem on Google Cloud. By integrating Rapid Storage, powered by Google's Colossus storage architecture, directly with Py Torch via the industry-standard fsspec interface, we…...