| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by wongarsu 1213 days ago
	There's a large body of research into invisible text watermarking, so this would certainly be possible. Maybe the simplest to implement in LLMs would be to bias the token generation slightly, for example by making tokens that include the letter i slightly more likely. In a long enough text you could then see the deviation from normal human text characteristics.

2 comments

PeterisP 1213 days ago

All the main scenarios for detecting generating text that I can imagine do have to assume that the LLM isn't "cooperative" but actually is specifically designed (or fine-tuned) to avoid detection.

link

lwhi 1213 days ago

Yep, I think it's a good idea.

link