|
|
|
|
|
by harrouet
52 days ago
|
|
The article I am responding to (which I've read) shows that these LLMs come with all sorts of hacks (= context bits) to make it behave more like this or more like that. There is probably a whole testing workflow at AI companies to tweak each new model until it "looks" acceptable. But they still don't understand what they are doing. This is purely empirical. |
|
Isn't that what the RLHF phase does ( https://www.paloaltonetworks.com/cyberpedia/what-is-rlhf )?