Hacker News new | ask | show | jobs
by JoshBlythe 67 days ago
The dual-watermark theory makes alot of sense for defensive engineering. You always assume your outer layer will be broken and so keep a second layer that isn't publicly testable. Same as defence in depth anywhere else. I'm curious - as new models are being built constantly and they're naturally non-deterministic, do you think it's possible for end users to prove that?
1 comments

> I'm curious - as new models are being built constantly and they're naturally non-deterministic, do you think it's possible for end users to prove that?

How is the model relevant? The models are proprietary and you never see any of its outputs that haven't been watermarked.