Hacker News new | ask | show | jobs
by theahura 358 days ago
- Could be wrong about Scale. I'm going off folks I know at client companies and at scale itself.

- idk I've trained a lot of models in my time. It's true that there's an arcane art to training LLMs, but it's wrong that this is somehow unlearnable. If I can do it out of undergrad with no prior training and 3 months of slamming my head into a wall, so can others. (Large LLMs are imo not that much different from small ones in terms of training complexity. Tools like torch and libraries like megatron make these things much easier ofc)

- there are a lot of fantastic researchers and I don't mean to disparage anyone, including anyone I didn't mention. Still, I stand by my beliefs on ml. Big changes in architecture, new learning techniques, and training tips and tricks come from a lot of people, all of whom are talking to each other in a very decentralized way.

My opinions are my own, ymmv

1 comments

Dude you went to Columbia, you probably dont think people that went to state schools are even human.

Rest of the article was good

> you probably dont think people that went to state schools are even human

on the contrary. I have been quite vocal about why I felt my education was lacking and the respect I have for those who have gone for nontraditional paths

Dude, going to a state school isn't a nontraditional path.