Hacker News new | ask | show | jobs
by PeterStuer 586 days ago
That was also my first thought. The discrepancy is just too large to be the mere result of a transformer model fed more chess data.