| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by tkgally 296 days ago
	Fortunately, em-dash users who have been posting to HN long enough can point to evidence of our pre-ChatGPT use: https://news.ycombinator.com/threads?id=tkgally&next=3380763...

2 comments

dang 295 days ago

Indeed!

https://news.ycombinator.com/comments?id=dang&next=33807246#...

https://news.ycombinator.com/item?id=27787448

link

latexr 295 days ago

Throwing another example in the pot.

https://news.ycombinator.com/item?id=24272893#:~:text=—

link

tkgally 295 days ago

Ah, I am in good company!

link

dang 295 days ago

We should have an em dash leaderboard. With a cutoff date of course.

link

tkgally 293 days ago

Done: https://news.ycombinator.com/item?id=45071722

link

dang 293 days ago

Wow, I didn't expect anyone to actually do that :)

link

stinkbeetle 295 days ago

So you're the ones who have been training the robots.

link

smt88 295 days ago

Reddit and HN are among the highest quality sources of training text and are probably weighted very heavily as "probably human" in the mainstream models.

Any source of text with huge amounts of automated and community moderation will be better quality than, say, Twitter.

link

what 295 days ago

Reddit is anything but high quality.

link

Jepacor 295 days ago

That depends heavily on the subreddits you browse. There absolutely are places with high quality content, though it feels like they are getting sparser and sparser.

link

kelnos 295 days ago

Not in that sense; high quality in the sense that there are a lot of actual, real people posting there, and those people tend to come from a pretty diverse set of backgrounds.

link

FiniteField 295 days ago

Perhaps on the smaller subreddits, but have a look at /r/all on any given day and it's obvious that real people, and diverse backgrounds, it is not. Every single subreddit that goes above a certain activity threshold collapses into the exact same state of astroturfed, mass-produced political slop targeted towards low IQ people.

link

AlexeyBelov 293 days ago

Yeah, there is still a lot of manoshpere / rightoid adjacent content on Reddit. It used to be worse though.

Old Reddit was.

Oh man, someone should train an LLM on pre-Digg death Reddit and modern Reddit and have them chat. It’d be a hoot.

link

jibal 295 days ago

"among the highEST" is comparative; it doesn't entail "high".

link

pyman 295 days ago

Although I'm sure @stinkbeatle was joking, I should clarify that most LLMs are trained on books and online articles written by professional writers. That's why they tend to have a rich vocabulary and use things like hyphens.

I agree, HN is an amazing community with brilliant people and top quality content, but it's not enough to train an LLM.

Last thing. An LLM is just a tool, it can clean up your writing the same way a photo app can enhance your pictures. It took a while for people to accept that grandma's photos looked professional because they had filters. Same will happen with text. With ChatGPT, anyone can write like a journalist. We're just not used to grandma texting like one, yet :)

link

Arnt 295 days ago

I really like that I can use an LLM to change tone. "Change the following text to sound like bland American officespeak."

That said, this feature doesn't sound like a great leap for mankind.

link

Moru 293 days ago

> With ChatGPT, anyone can write like a journalist.

Minus the fact-checking, transparency, truth and social responsibility.

link

WalterBright 295 days ago

> HN is an amazing community with brilliant people

Correction: bright people

link