Hacker News new | ask | show | jobs
by swalsh 843 days ago
There was a period of time where data was easily accessible, and Open AI suctioned up as much of it as possible. Places have locked the doors since then realizing someone was raiding their pantry.

To get that dataset now would take significantly more expense.

1 comments

I would have thought that Anna's Archive is still the best source of high quality tokens and that is fully open.