Hacker News new | ask | show | jobs
by danielhanchen 138 days ago
Super cool! Also with `--fit on` you don't need `--ctx-size 32768` technically anymore - llama-server will auto determine the max context size!
1 comments

Nifty, thanks for the heads-up!