Hacker News new | ask | show | jobs
by haolez 805 days ago
What's the feeling of playing with a powerful base model? Will it just complete the prompt text like a continuation of it?
1 comments

Generally, yes, it literally just tries to predict the next token again and again and again.

This model is apparently surprisingly good at chat, even though it is a base model, and will take part it it to some extent. It should be really interesting once it's fine-tuned.