| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by srush 238 days ago
	Our primary focus is on RL post-training. We think that is the best way to get the model to be a strong interactive agent.

1 comments

So, yes, but you won’t say what the base model is? :)

It seems like a sort of sonnet model as a lot of people are reporting it like to spam documentation on Twitter like sonnet 4.5