Hacker News new | ask | show | jobs
by uncomplexity_ 545 days ago
the mini models are the most exciting to me. they seem to hit the balance of general utility and cost.

people are fond of comparing large bleeding edge models, but when we compare the small ones to say gpt3.5, it is still astonishing.

to me the smaller models being the balance of intelligence and cost is the best indicator of what the general population can use and afford.

the larger models end up being used mostly by individuals teams and orgs who can afford to pay for it and learn how to use it.

keep in mind at this day and age there is still a lot of people who dont know these things exist. it's just outside of their reality. then there are who have the slightest idea about it, but won't commit the time and money needed to learn it and fully experience it.

this is like the new generational transfer of wealth and information. even if you're an individual or a small team, with enough levers you can use these technologies to your advantage, and with more levers (like yc) you can enter the battlefield and compete with existing companies.