AIMarkTechPostabout 10 hours ago

Google DeepMind Releases Gemma 4 QAT Checkpoints

7 min read

Google DeepMind released Quantization-Aware Training (QAT) checkpoints for the Gemma 4 family, targeting edge devices and consumer GPUs. The new mobile QAT schema reduces Gemma 4 E2B to about 1GB. QAT improves quality at the same size as standard PTQ, without changing memory footprint. The Q4_0 QAT format drops E2B to 3.2 GB and E4B to 5 GB.

Level

Hype check

Tap to vote and see what everyone thinks.

#google deepmind #gemma 4 #quantization

Read full story

More to chew on!

Mobile2 days ago

Gemma 4 and Qwen 3.5 9B Compared in Local LLM Tests

AI6 days ago

Blazing fast on-device GenAI with LiteRT-LM

Mobile