Hacker News new | ask | show | jobs
by mgsloan2 300 days ago
You could, but it is extremely expensive to train an LLM that is competitive on coding evals. So, I was assuming use of a model someone else trained.

Also, if it is only trained on code, it's likely to miss out on all the world knowledge that comes from the rest of the data.

1 comments

fine tune instead of training from scratch might help.