Hacker News new | ask | show | jobs
by bddppq 987 days ago
lepton is at a different layer comparing to llama.cpp, in fact for LLM model files that are of GGUF format, it's using llama.cpp (ctransformers to be precise) as the execution engine