|
|
|
|
|
by sebastiennight
501 days ago
|
|
You don't want that as a product, in the sense that having an AI model train itself by simply having internal conversations without ever looking at any human-written content, might result in something that humans cannot comprehend. Also, well - there's the technicality of "you don't 'win' a conversation like you can 'win' at Go", so how would you know to reward the model as you're training it? |
|
https://i.imgur.com/CBmMSqO.png, perhaps