FilterHN
new
ask
show
jobs
submit
FilterHN
show menu
Gemma 3 QAT (Quantized Aware Training) 3x less memory
5 points
by
philschmidxxx
18 hours ago
|
past
| 2 comments
|
huggingface.co
|
HN
▲
bigdict
16 hours ago
[-]
Amazing, I've been wishing for this! Do you have any estimates on how much accuracy is first lost then recovered compared to the original bf16 and the naively quantized models?
reply
▲
bigdict
16 hours ago
[-]
Thank you so much for continuing to support Gemma 3 with these updates.
reply