Hacker News new | ask | show | jobs
by throwa356262 26 days ago
It is a combination of 3 things

1. Some companies are very good in training and serving at much lower cost

2. Some companies have access to new much cheaper hardware

3. People have realzeid that you dont need a 3.2T model when a 310B one (Opus vs MiMo 2.5) performs equally well for your particular task.