Hacker News new | ask | show | jobs
by version_five 1029 days ago
I wonder if blocking gptbot is a good signal that a website has non LLM generated content on it, and is therefore good training data...
1 comments

An illustrator specifically wrote[1] that this is why they won't be tagging their social media posts with #HumanMade, #NoAI and similar, as it's a signal there's unadulterated training data.

[1] https://www.davidrevoy.com/article977/artificial-inteligence...