We've done some vibe checks on it with OpenHands and it indeed performs roughly as good as Sonnet 4.5.
OSS models are catching up