| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by rahimnathwani 843 days ago

Why are you horrified?

In designing software, there's often a trade off between (i) generality / configurability, and (ii) performance.

llama.cpp is built for inference, not for training or model architecture research. It seems reasonable to optimize for performance, which is what ~100% of llama.cpp users care about.

1 comments

throwawaybbq1 841 days ago

GGUF files seems to be proliferating. I think some folks (like myself) make an incorrect assumption that the format has more portability/generalizability than it appears to have. Hence, the horror!

link