Hacker News new | ask | show | jobs
by ekojs 51 days ago
Not at all, I actually run ~30B dense models for production and have tested out 5090/3090 for that. There are gotchas of course, but the speed/quality claims should be roughly there.