Hacker News new | ask | show | jobs
by orost 1096 days ago
Experimental Falcon inference via ggml (so on CPU): https://github.com/cmp-nct/ggllm.cpp

It has problems but it does work