|
|
|
|
|
by cubefox
1189 days ago
|
|
The model was just instruction tuned. That is, it can answer questions and respond to instructions. Unless especially prompted, a pure predictor would often not respond to instructions but elaborate on them. To get specific types of responses, you would need something like RLHF. |
|