Hacker News new | ask | show | jobs
by baxtr 205 days ago
I was under the impression for this to work like that, training data needs to be plenty. One project is not enough since it’s too "sparse".

But maybe this example was used by many other people and so it proliferated?

1 comments

The repo[0] currently has been forked ~41300 times.

[0] https://github.com/wesbos/JavaScript30

It’s quite unlikely that training data will include duplicate repositories or even forks, that alone would surpass the published dataset sizes.