Hacker News new | ask | show | jobs
by Shemetz 976 days ago
Another pitfall: this is easily circumvented by the end user generating a long text and then randomly adding/removing a few words here and there. This could be solved by changing the simple check of "every Tth token belongs to S" to something like "the average distance between subsequent S tokens is very close to T".