Hacker News new | ask | show | jobs
by mirekrusin 793 days ago
LLMs are super-intelligent at mimicking already, it won't take much time to find some kind of RL loop there.