Hacker News new | ask | show | jobs
by gwern 60 days ago
It is definitely not the first codebase an extensively RL-trained Claude has ever analyzed. How do you think it got so good?
1 comments

Meaning it has no episodic memory of any of those analyses that it has done.
You didn't say anything about 'episodic' and that's irrelevant to the point even if its long-term memory from training didn't count.