Hacker News new | ask | show | jobs
by aazo11 417 days ago
This is a huge unlock for on-device inference. The download time of larger models makes local inference unusable for non-technical users.