Hacker News new | ask | show | jobs
by yieldcrv 191 days ago
15 t/s way too slow for anything but chatting, call and response, and you don't need a 3T parameter model for that

Wake me up when the situation improves

1 comments

Just wait for the M5-Ultra with a terabyte of RAM.