Hacker News new | ask | show | jobs
by devxpy 1283 days ago
Would also love this to include InsturctGPT architecture with its RL reward model!