Hacker News new | ask | show | jobs
by cherioo 198 days ago
Allegedly deepseek is doing this because they don’t have enough gpu to serve two models concurrently.