| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by version_five 1036 days ago
	Short answer, possibly, especially if it was say part of a rag system or some other architecture like that. There is room for more. Nothing particularly special about llama.cpp as the llm back end though. It's optimized for running on lower-end hardware which matters less if you're serving models as a service. But it has many strengths. Llama.cpp / ggml is the open core of ggml.ai founded by GG and funded by the guy from github, so they have some monetization plan for it.