Hacker News new | ask | show | jobs
by onion2k 221 days ago
Exactly the point. If there's a lot of data in the training set the results will be better.
1 comments

I guess I'm trying to emphasize the distinction between information in the repo (code) vs. information elsewhere (discussions) that the model looks at.