Hacker News new | ask | show | jobs
by Havoc 499 days ago
How does that fit into a 4090? The files on the repo look way too large. Do they mean a quant?
1 comments

> Mistral Small can be deployed locally and is exceptionally "knowledge-dense", fitting in a single RTX 4090 or a 32GB RAM MacBook once quantized.