Hacker News new | ask | show | jobs
by jointpdf 1214 days ago
The skill is in creating the training data in the first place.

Training a model is hardly a skill. It’s more like playing Tamagotchi—check on it once in a while to make sure it hasn’t died, and guess at ways to make it happier in the future.

1 comments

I agree with your first statement, and disagree with the second one. Training a new, non-trivial model is 1/3 craft, 1/3 science, and 1/3 art. There are not very many people in the world capable of training GPT-4 level models or beating state of the art in image generation.