Hacker News new | ask | show | jobs
by Snuggly73 271 days ago
To my untrained eye, this looks more like an instruct dataset.

For just plain text, I really like this one - https://huggingface.co/datasets/roneneldan/TinyStories