|
|
|
|
|
by nutanc
2006 days ago
|
|
Empirical success shows that the GPT-3 model has seen the sequence before(maybe many times). Transformer architectures do map sequences to sequences. What is not known is that the task of programming is a sequence problem. This experiment seems to suggest that maybe its not a sequence problem. |
|