Hacker News new | ask | show | jobs
by jochalek 258 days ago
Sounds like something that could be implemented with llm-d, though I've not experimented with it.

https://llm-d.ai/blog/intelligent-inference-scheduling-with-...

1 comments

Yeah, I don't see why we could not integrate that. I think that is the next step as we move our workloads to production.
`lf deploy` here we come!