Y
Hacker News
new
|
ask
|
show
|
jobs
by
Tarean
2164 days ago
The transformer model in GPT-3 has a short context window and no recurrence. Without some significant architecture changes that is a fundamental limit on the problems GPT-3 can solve.
1 comments
lucidrains
2164 days ago
https://arxiv.org/abs/1807.03819
link