Hacker News new | ask | show | jobs
by HumanOstrich 130 days ago
How do you objectively measure if an LLM is training on something if you don't have access to its training data?
1 comments

In theory the same way people are making those claims about "stolen" art, such as models that produced watermarks from Getty images or Shutterstock. Similar "watermarks" have existed in some LLM output.