Hacker News new | ask | show | jobs
by yorwba 3213 days ago
The Transformer network is solving a different problem: translating a given sentence into another with the same meaning.

The problem discussed here is about completing the next word in a partial sentence, where AFAIK some variety of RNN is still best. It might be possible to adapt the Transformer architecture to that task, but that would make it a different model.