Hacker News new | ask | show | jobs
by lewtun 251 days ago
For those interested in playing with an implementation of these ideas, my colleagues at HF made some recipes here: https://github.com/huggingface/trl/blob/main/docs/source/lor...