Hacker News new | ask | show | jobs
by janalsncm 339 days ago
I didn’t think it was vague. Given an existing piece of software, write a detailed spec on what it does and then reward the model for matching its performance.

The vague part is whether this will generalize to other non software domains.

1 comments

> write a detailed spec on what it does

A much harder task than writing said software