|
|
|
|
|
by visarga
1045 days ago
|
|
In my tests LLaMa2-13B is useable for information extraction tasks and LLaMA2-70B is almost as good as GPT-4 (for IE). These models are the real thing. We can fine-tune LLaMAs, unlike OpenAI's models. Now we can have privacy, control and lower prices. We can introduce guidance, KV caching and other tricks to improve the models. The enthusiasm around it reminds me of JavaScript framework wars of 10 years ago - tons of people innovating and debating approaches, lots of projects popping up, so much energy! |
|
Hmm. If LLMs turned out like JS frameworks, that would mean that in ten years people will be saying:
“Maybe we don’t really need all this expensive ceremony, honestly this could be done with vanilla if/else heuristics…?”