Bonsai 8B: a 1-bit LLM that fits in 1.15GB
4 points
1 hour ago
| 1 comment
| firethering.com
| HN
lifecodes
1 hour ago
[-]
If this holds, does it unlock 100B+ models running locally in ~tens of GB RAM? Or does accuracy collapse before that point?
reply