Hacker News new | ask | show | jobs
by nravic 1060 days ago
Self plug: run llama.cpp as an inference server on a spot instance anywhere: https://cedana.readthedocs.io/en/latest/examples.html#runnin...
1 comments

Looks cool, joined the waitlist.