Hacker News new | ask | show | jobs
by nl 83 days ago
I mean you can run a 1T model on consumer hardware now by doing things like layer offloading and streaming from SSD. It's just too slow to be useful.