|
|
|
|
|
by yorwba
3213 days ago
|
|
The Transformer network is solving a different problem: translating a given sentence into another with the same meaning. The problem discussed here is about completing the next word in a partial sentence, where AFAIK some variety of RNN is still best. It might be possible to adapt the Transformer architecture to that task, but that would make it a different model. |
|