Hacker News new | ask | show | jobs
by Turn_Trout 854 days ago
I'm the author of the GPT-2 work. This is a nice post, thanks for making it more available. :)

Li et al[1] and I independently derived this technique last spring, and also someone else independently derived it last fall. Something is in the air.

Regarding your footnote 2 re capabilities: I considered these kinds of uses before releasing the technique. Ultimately, practically successful real-world alignment techniques will let you do new things (which is generally good IMO). The technique so far seems to be delivering the new things I was hoping for.

[1] https://openreview.net/forum?id=aLLuYpn83y