|
|
|
|
|
by khafra
405 days ago
|
|
"LLM whisperer" folks will confidently claim that base models are substantially smarter than fine-tuned chat models; with qualitative differences in capabilities. But you have to be an LLM whisperer to get useful work out of a base model, since they're not SFT'ed, RLHF'ed, or RLAIF'ed into actually wanting to help you. |
|
Is it like in the early GPT-3 days, when you had to give it a bunch of examples and hope it catches the pattern?