Y
Hacker News
new
|
ask
|
show
|
jobs
by
ac29
68 days ago
This 35B-A3B model is 4-5x cheaper than Haiku though, suggesting it would still be cheaper to outsource inference to the cloud vs running locally in your example