|
|
|
|
|
by famouswaffles
227 days ago
|
|
You're asking it if it can feel the presence of an unusual thought. If it works, it's obviously not going to say the exact same thing it would have said without the question. That's not what is meant by 'alteration'. It doesn't matter if it's 'altered' if the alteration doesn't point to the concept in question. It doesn't start spitting out content that will allow you to deduce the concept from the output alone. That's all that matters. |
|
I think this technique is going to be valuable for controlling the output distribution, but I don't find their "introspection" framing helpful to understanding.