Hacker News new | ask | show | jobs
by smcleod 1 day ago
Those dense models are pretty fast with MTP now. 40-70TK/s depending on your machine, that's faster than cloud models (although not as smart obviously).