| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by BalinKing 105 days ago
	Sorry, updated my original comment—I meant to qualify it to only those cases where it's blatantly obvious. Obviously a lot of ambiguous comments will slip through as a result, but I agree with you that false negatives are better than false positives.

2 comments

vel0city 105 days ago

Your comments use em dashes. Many would claim those are vastly overrepresented in AI language and thus an account overly using them are blatantly AI.

I don't think your account is AI just by these few comments, but I would like to point out that most rubrics one might use to determine what is obviously AI might end up including the way you talk.

If there was a truly accurate tell, some algorithm you could feed a few sentences in and it could tell you "yep, this is 100% AI", then yeah sure use that. I don't know you could realistically build that machine, especially when it comes to the generation of text.

link

scratchyone 105 days ago

For what it's worth, there are modern LLM detectors with extremely low false-positive rates. The tech has advanced quite a bit since the ZeroGPT days. Personally I've gotten very good results from Pangram Labs. Still can't directly ban people though because false positives are always possible.

link

diacritical 105 days ago

Are they great at detecting normal prompts that don't try to make the LLM speak non-LLM-ishly? If you make the LLM not use em dashes, "it's not; it's" phrases and similar things, and if you make it make a few mistakes here and there, would it still be detected? My point is that if people aren't trying to hide their LLM use, it might work, otherwise it probably wouldn't. How would a detector tool work against output where the prompt tells the LLM to alter the way it writes? Or if the LLM output is being modified by another LLM specifically designed to mimic certain styles?

Like, why would my comment (or yours, or any other comment) pass or fail the LLM check the I/you/someone else used specific prompts or another LLM to edit the output? It seems like these tools would work on 99.9% of the outputs, but those outputs likely weren't created in an adversarial way.

link

zahlman 105 days ago

Is that false-positive rate from your own testing, or the author's claims? What is the source of ground truth?

link

i_think_so 104 days ago

I will never, ever forgive these techbros for ruining emdashes. I will also never stop using them -- they are a permanent part of my writing style -- no matter the personal consequences.

link

mroche 105 days ago

Your comments use em dashes. Many would claim those are vastly overrepresented in AI language and thus an account overly using them are blatantly AI.

I've always found this funny. Doesn't macOS' default text substitution enable (annoying to me) things like em-dash, smart quotes, etc?

link

sfn42 105 days ago

Can you show an example of "blatantly obvious"?

link

foltik 105 days ago

https://news.ycombinator.com/threads?id=naomi_kynes

https://news.ycombinator.com/threads?id=aplomb1026

https://news.ycombinator.com/threads?id=CloakHQ

https://news.ycombinator.com/threads?id=decker_dev

link

i_think_so 104 days ago

Oof. Some of those seemed reasonable at first. Ex: CloakHQ's comment on Compaq/DEC...

....until you start scrolling down the page and it becomes screamingly obvious that everything it says comes from the same template.

Maybe the problem isn't just that AI produces gobs of useless crap. Maybe what's worse is that it can produce even more mediocre crap that crowds out the good?

All oatmeal, no steak, leads to "starvation" by poor nutrition.

link