|
|
|
|
|
by atonse
211 days ago
|
|
Let me clarify, when I perform interviews, I tell my candidates they can do _everything_ you would do in a normal job, including using AI and googling for answers. But just to humor you (since I did make that strong statement), without googling or checking anything, I would start with basic regular expression ranges (^[A-za-z\s\.\-*]) etc and do a find-replace on that until things looked coherent without too much loss of words/text. But the problem isn't me, is it? It's the AI companies and their crawlers, that can trivially be changed to get around this. At the end of the day, they have access to all the data to know exactly which unicode sequences are used in words, etc. |
|
true.
It does put the AI companies in the position though of continuing to build/code software that circumvents their attempts to steal content though.
Which might be looked upon unfavorably whenever dragged to court.