Hacker News new | ask | show | jobs
by -_- 254 days ago
Yes! At https://RunRL.com we offer hosted RL fine-tuning, so all you need to provide is a dataset and reward function or environment.