|
|
|
|
|
by somat
932 days ago
|
|
Speaking of remembering training data, I see that as a big problem with chat based systems. They swallow a bunch of data, then generate something when prompted, My worry is not so much copyright infringement but more something like citation needed? Has anyone done any work to produce citations for the generated data? |
|
Though it sounds like even their much cheaper clever approach is still very expensive.
[1] paper at https://arxiv.org/abs/2308.03296, post at https://www.anthropic.com/index/influence-functions