Hacker News new | ask | show | jobs
by bigyabai 245 days ago
I'm looking forward to future ollama releases that might attempt parity with the cloud offerings. I've since moved onto the Ollama compatibility API on KoboldCPP since they don't have any such limits with their inference server.
1 comments

I am super hopeful! Hardware is improving, inference costs will continue to decrease, models will only improve...