Hacker News new | ask | show | jobs
by int_19h 631 days ago
If you have a reviewed output dataset from an LLM, you could use it for RLHF.