gen‑ai.news
← Back
Image

Google DeepMind releases DiffusionGemma, a model that runs local AI 4x faster

Google DeepMind has released DiffusionGemma, an open language model that uses diffusion-based generation rather than the token-by-token autoregressive approach found in most large language models today. The result, according to the company, is text generation that runs up to four times faster - a meaningful gap for anyone running models locally on consumer hardware.

Diffusion models work by learning to iteratively refine outputs from noise toward a coherent result. This process is well established in image generation, where models like Stable Diffusion and Flux have made it the standard approach. Applying the same principle to text is more technically involved, partly because language is discrete rather than continuous, but research progress over the past couple of years has made the approach increasingly viable for practical use.

Most existing text diffusion efforts have come from academic labs or smaller startups. DiffusionGemma is notable because it comes from a major lab and ships as an open model, meaning developers can download and run it without API dependencies. The speed advantage is particularly relevant for local inference, where compute constraints make the cost of autoregressive decoding more acute. A 4x throughput improvement can translate directly into more responsive applications or the ability to run larger contexts on the same hardware.

Google DeepMind has been steadily expanding the Gemma family of open models, which have generally tracked closely with the techniques used in its proprietary Gemini line. DiffusionGemma represents a different architectural branch within that family, and it will be worth watching how the model performs on standard benchmarks relative to autoregressive Gemma variants of comparable size. If the quality holds up alongside the speed gains, diffusion-based text generation could start seeing wider adoption beyond the research context where it has mostly lived until now.

Enjoy this story? Get the next one in your inbox.

Twice a week: the most important stories in generative image and video AI, distilled into a 2-minute read.

Free. Unsubscribe any time. No spam, ever.

Your next read

Image

The EU doesn't really know what a deepfake is, and that's becoming a problem for retail

A major European retail trade group is pushing back against the EU AI Act's transparency requirements, arguing that AI-generated product imagery - think a sofa in a computer-generated living room - should not be classified alongside deepfakes. The dispute exposes a genuine ambiguity in the regulation's language that has real consequences for how online retail operates. With platforms like Zalando reporting that 90 percent of their marketing content is already AI-generated, the stakes are signifi

Image

Adobe’s redesigned AI studio remembers what your creations look like

Adobe is rolling out a redesigned Firefly AI studio in private beta, bringing editing and image generation into a single interface. A key addition is the ability to save named visual elements - characters, objects, and backgrounds - so they can be reused consistently across projects without drifting in appearance.

Image

Adobe brings its Firefly AI Assistant inside of Premiere, Photoshop and Illustrator

Adobe has integrated its Firefly AI assistant directly into Premiere Pro, Photoshop, and Illustrator, bringing generative AI tools into the core workflow of its most widely used creative applications. Rather than requiring users to switch between separate tools or platforms, the assistant is now accessible from within each app. The move reflects Adobe's ongoing effort to embed AI capabilities at the point where creative work actually happens.