Hacker News new | ask | show | jobs
by Tarean 2164 days ago
The transformer model in GPT-3 has a short context window and no recurrence. Without some significant architecture changes that is a fundamental limit on the problems GPT-3 can solve.
1 comments