Hacker News new | ask | show | jobs
by jiveturkey42 844 days ago
AI is about to become really sarcastic, pedantic, and absurdly moralistic
4 comments

Every LLM almost certainly already is trained on Reddit from before 2015 when the entire site content was compiled for research.

This is just adding in more recent content.

> makes a banter

> gets a pedantic reply

were you also trained on reddit?

No, I did my small part in contributing to the training data on Reddit though.
And even more confidently wrong.
Can’t wait for heavily downvoted unfortunate truths to be eliminated from AI
That already happens. Standard academic datasets that use Reddit start by eliminating any comments that are under +3, for example.
Confidently wrong? I thought it was getting access to Reddit, not HN.
Hate on HN all you want, I’ve been without my ADHD meds (warning: the company “Done” is not technically a scam) and spending way too much time on each for the past few days, and I can say this for sure: at least people on HN pay heed to the concept of premises and careful, non-combative argument. Most responses on Reddit are “no, that’s dumb” or “yes, that reminds me of my metaphysical takes”…
Not only that, reddit hive mind is plain wrong in most of the cases. Plus in number of occasions the "le reddit investigation", "we did it reddit" excrement caused real-world issues for people that they were targeting, and those people were innocent.

Reddit is ok and quite cool for targeted discussion on targeted sub-reddits. But all the general subreddits visited by general population and everything that pops once in a while on the front page is a target for hive mind.

For HN comparison, there is a lot of "wrong" here too, but here you can find a cited academic study from one good American university that reveals most of the botfarms and fake news disseminators come from western sphere. If you try to claim on Reddit or anywhere on the internet that fake news champion is not Russia+China+whoever is evil, your entry will get buried.

Also, ask yourself who's the median redditor. For my country's national subreddit the median redditor is a high school kid from the capital.

Try saying anywhere on Reddit "WD-40 is a lubricant" and be prepared to face a tsunami of incorrect information. Or say anything about glyphosate.
I don't know what's their deal with glyphosate but I'm pretty confident that avg Redditor never held a can of WD in their life.
"Often wrong, never in doubt"
That's already what googlers developing it are
I don't understand why they chose Reddit, was 4Chan not willing? Seriously though, why not use the comment and discussion sections of Wikipedia and other sites that are not drowning in social insanity.