|
|
|
Slop Bucket Idea – a dataset of AI slop (train AI what not to do)
|
|
2 points
by IAmNeo
29 days ago
|
|
I just had this idea, you read it all the time AI slop is so prevalent people are getting banned for a year for submitting science papers to arXiv with it, moans of angst from developers, even Microsoft doing its own study where AI degrades the quality of simple documents, and the beloved em-dash. I don't really have the know-how or the time but it occurred to me, if we created a public data set that could be submitted to publicly, we could catalog and organize all the AI slop, the different types, with explanations about why it is slop and why not to do it, and then train a large language model using this data set included, to help correct itself. I don't really know the technical details of training a large language model,is this even possible? |
|
This doesn't mean this stuff will improve drastically as it might just highlight how low the bar is for some folks, but it is better than nothing.
Sort of like how reality TV was considered broadcast anesthetic slop, and yet people couldn't get enough of it and it is still with us decades later.