Hacker News new | ask | show | jobs
by fnbr 1041 days ago
Yeah, you’re totally right. I actually wrote a follow up essay about that:

https://finbarr.ca/llms-not-trained-enough/

I think the conversations were partly (largely?) a snapshot in time. I was talking to people in February/March, and all of this was much less thought through at the time. But you’re totally right. You want something like Llama, where you train a smaller model longer than Chinchilla would predict.