Hacker News new | ask | show | jobs
by Veedrac 2124 days ago
GPT-3's failure at larger addition sizes is almost fully due to BPE, which is incredibly pathological (392 is a ‘digit’, 393 is not; GPT-3 is also never told about the BPE scheme). When using commas, GPT-3 does OK at larger sizes. Not perfect, but certainly better than should be expected of it, given how bad BPEs are.

http://gptprompts.wikidot.com/logic:math

1 comments

My thinking there wasn't because of BPEs, I think it's a graph traversal issue.