Y
Hacker News
new
|
ask
|
show
|
jobs
by
jochalek
258 days ago
Sounds like something that could be implemented with llm-d, though I've not experimented with it.
https://llm-d.ai/blog/intelligent-inference-scheduling-with-...
1 comments
rgthelen
258 days ago
Yeah, I don't see why we could not integrate that. I think that is the next step as we move our workloads to production.
link
mhamann
258 days ago
`lf deploy` here we come!
link