
Google released DiffusionGemma, an open weight Apache 2 licensed model (google/diffusiongemma-26B-A4B-it). NVIDIA hosts it for free on NIM cloud API. The model generated 2,409 tokens in 4.4 seconds, achieving at least 500 tokens per second.
Tap to vote and see what everyone thinks.