Hacker News new | ask | show | jobs
by nsingh2 68 days ago
It's going to be expensive to serve (also not generally available), considering they said it's the largest model they've ever trained.

I suspect it's going to be used to train/distill lighter models. The exciting part for me is the improvement in those lighter models.

2 comments

It seems inevitable that costs will come down over time. Expensive models today will be cheap models in a few years.
What's interesting is that scaling appears to continue to pay off. Gwern was right - as always.