Hacker News new | ask | show | jobs
by bhy 937 days ago
Like most research they likely started with a smaller model like GPT 2 or 3 and shown that they can significantly boost the performance to the extent of solving grade school math.