Hacker News new | ask | show | jobs
Deploying inference endpoints with PD disaggregation on AMD GPUs (dstack.ai)
2 points by cheptsov 24 days ago