Hacker News new | ask | show | jobs
by monkpit 1123 days ago
The title is a bit much, no?
2 comments

Yes, it violates site guidelines and should be "AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback"
Not really. Pretty much the "killer app" feature of ChatGPT is RLHP. Whether or not the current RLHP-ed Alpca really beats ChatGPT, it is pretty obvious that local LLMs can be RLHP-ed and it is only a matter of time before people realize running an RLHP-ed LLM locally is a better option than running ChatGPT with all the security concerns of running something "in the cloud" (which is just "somebody else's computer" in the famous saying).
I was referring to the HN guidelines against editorializing titles.
I’m sorry what’s RLHP? I’m not able to Kagi that
The P should be an F, it's reinforcement learning from human feedback
Reinforcement learning through human feedback.

Took me a bit of searching too.