|
|
|
|
|
by smarterclayton
394 days ago
|
|
llm-d would make sense if you are running a very large production LLM serving setup - say 5+ full H100 hosts. The aim is to be much more focused than kserve is on exactly the needs of serving LLMs. It would of course be possible to run alongside kserve, but the user we are targeting is not typically a kserve deployer today. |
|
I wonder if inference-d would be a fitting name.