Hacker News new | ask | show | jobs
by tempusalaria 270 days ago
A lot of the current code and science capabilities do not come from NTP training.

Indeed in seems in most language model RL there is not even process supervision, so a long way from NTP