Hacker News new | ask | show | jobs
by mike_hearn 39 days ago
You can disaggregate though. So draft models can run on cheaper hardware with less RAM, saving time on the more expensive machines with more RAM.