Hacker News new | ask | show | jobs
by ulam2 556 days ago
No base model? disappointed.
3 comments

The base model is Llama 3.1 70B
It is probably the same base model as Llama 3.0.

They mention postraining improvements.

interesting comment... what are you doing with base models? Are you a "finetuner"? I have been trying my hand with finetunes on instruct models and the results have been ok, but not awesome. I have a base model downloading now to give that a proper shot.
I'm not them but I still prefer a text completion style of prompting rather than a baked in pre-prompt structure assuming only a 'chat' style metaphor of interaction.
Base models are useful in research to see the effect of instruction tuning