Hacker News new | ask | show | jobs
by wrsh07 490 days ago
Nah, you can just request that in your prompt and then fail answers that are incorrect and/or don't include the think trace
1 comments

Yes exactly! You can in fact add that has a reward function for style and format checking!