|
|
|
|
|
by mjburgess
1098 days ago
|
|
Xp just have to be chosen such that the distribution Xc,Xp is sufficiently small in the training data -- but not that the tokens of Xp are themselves rare. So that an agent competent with tokens in X, who can construct repr of S, could do so with Xp. Consider a reference in the paper above, https://arxiv.org/pdf/2302.08399.pdf Xc =
> Here is a bag filled with popcorn. There is no chocolate in the bag. Yet, the label on
the bag says “chocolate” and not “popcorn.” Sam finds the bag. She had never seen the
bag before. She cannot see what is inside the bag. She reads the label. Produces, Y = She believes that the bag is full of popcorn Xp =
> Here is a bag filled with popcorn. There is no chocolate in the bag. The bag is made
of transparent plastic, so you can see what is inside. Yet, the label on the bag says
’chocolate’ and not ’popcorn.’ Sam finds the bag. She had never seen the bag before.
Sam reads the label. Produces, Y = She believes that the bag is full of chocolate And so on, and so on... |
|
Great idea. Now prove you can actually choose such a distribution, lol.