Hacker News new | ask | show | jobs
by tinthedev 15 days ago
You misunderstand the "test" here to mean programming, rather than test against the model's capabilities.
1 comments

thanks for pointing that out. makes sense.