| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Veedrac 2124 days ago
	GPT-3's failure at larger addition sizes is almost fully due to BPE, which is incredibly pathological (392 is a ‘digit’, 393 is not; GPT-3 is also never told about the BPE scheme). When using commas, GPT-3 does OK at larger sizes. Not perfect, but certainly better than should be expected of it, given how bad BPEs are. http://gptprompts.wikidot.com/logic:math

1 comments

My thinking there wasn't because of BPEs, I think it's a graph traversal issue.