Y
Hacker News
new
|
ask
|
show
|
jobs
by
storus
200 days ago
Those don't have DPO/GRPO which arguably made some parts of RL obsolete.
2 comments
nafizh
200 days ago
check out cs 336 stanford, they cover DPO/GRPO and relevant parts needed to train LLMs.
link
storus
200 days ago
It's also covered by CS329H.
link
upbeat_general
200 days ago
I can assure you that lacking knowledge in DPO (and especially GRPO it’s just stripped down PPO) is not a dealbreaker.
link