Hacker News new | ask | show | jobs
by zhisbug 527 days ago
We study how to approximate the famous shortest-job-first scheduling in LLM inference!