|
|
|
|
|
by chessgecko
779 days ago
|
|
This is the sparsest model thats been put out in a while (maybe ever, kinda forget the shapes of googles old sparse models). This probably wont be a great tradeoff for chat servers, but could be good for local stuff if you have 512GB of ram with your cpu. |
|
EDIT: This[0] confirms 240GB at 4-bit.
[0]: https://github.com/ggerganov/llama.cpp/issues/6877#issue-226...