Hacker News new | ask | show | jobs
by jack_pp 26 days ago
We can trust the feedback we give it based on the output it provides.
1 comments

What kind of feedback are you giving? What's the reward function?
Right now, no feedback since I don't run this system but our workflows could change to accommodate it