|
|
|
|
|
by yacine_
1064 days ago
|
|
If you fine tune them to be task specific, they'll perform well. In my experience, this control loop is a better investment than "prompt engineering". (When I say task specific, I mean very task specific) GPT4 over the API is too fine tuned, which constrains its behavior. It fails to capture nuance in instructions. When you have the bag of weights, you can actually control your model. Having actual control over the model, and understanding the infrastructure that it's running on helps you meet actual SLAs. And it's cheaper, if you're not backed by infinite venture money. https://arxiv.org/abs/2307.13269 |
|
I don't see how this warrants the extra exciting popularity of LLAMA2, etc.
I still haven't found my own personal niche "good enough" test case