Gemma 3 QAT (Quantized Aware Training) 3x less memory
5 points
18 hours ago
| 2 comments
| huggingface.co
| HN
bigdict
16 hours ago
[-]
Amazing, I've been wishing for this! Do you have any estimates on how much accuracy is first lost then recovered compared to the original bf16 and the naively quantized models?
reply
bigdict
16 hours ago
[-]
Thank you so much for continuing to support Gemma 3 with these updates.
reply