Hacker News new | ask | show | jobs
by adsharma 321 days ago
What's the best number on vLLM and SGlang so far on H100?

It's sad that MLPerf takes a long time to catch up to SOTA models.