Hacker News new | ask | show | jobs
by petercooper 702 days ago
It's hardly cheap starting at about $10k of hardware, but another potential option appears to be using Exo to spread the model across a few MBPs or Mac Studios: https://x.com/exolabs_/status/1814913116704288870
1 comments

Or maybe using Distributed Llama? https://github.com/b4rtaz/distributed-llama