Hacker News new | ask | show | jobs
by minimaxir 1842 days ago
That's the one thing very consistent with Transformer models, even GPT-3.