Sometimes it seems really accurate (like the cherry-picked GIF in the overview docs) and sometimes really off.
I think for the most part, it knows more than it lets on, but finding the right sampling methods (or better yet, generalized search) to generate the best comments is a tough problem because it's difficult to evaluate quality.
I think for the most part, it knows more than it lets on, but finding the right sampling methods (or better yet, generalized search) to generate the best comments is a tough problem because it's difficult to evaluate quality.
There's some info on the sampling methods here: https://chrisbutner.github.io/ChessCoach/high-level-explanat...