I'd be suprised if they weren't making sure that Facebook could recognize content made with their tooling. Watermarks will always be on the honors system though, people could always remove them before posting.
For all the talk about safety it's surprising to me how little watermarking is going on, these AI producers are content with generating unmarked garbage.
I figure there's a lot you could do with interwoven zerowidth space characters and other unicode tricks (using other code blocks for otherwise normal characters like you would when spoofing a url) - sure it would be easy to write software to normalize it back to ascii but at least that requires intent to deceive - we could even write laws against stripping encoding schemes meant to identify automated content
If I'm not mistaken, didn't the major LLM companies try to band together for watermarking type features only to walk it back later and admit it wouldn't work?
I figure there's a lot you could do with interwoven zerowidth space characters and other unicode tricks (using other code blocks for otherwise normal characters like you would when spoofing a url) - sure it would be easy to write software to normalize it back to ascii but at least that requires intent to deceive - we could even write laws against stripping encoding schemes meant to identify automated content
Prior art is that time Genius encoded "red handed" into morse code by way of alternating apostrophes to catch Google copying their lyrics: https://gizmodo.com/genius-claims-it-busted-google-stealing-...