Hacker News new | ask | show | jobs
by AnotherGoodName 398 days ago
To be clear, it's the original purpose of LLMs.

The whole LLM scene today came about because context was really important to translations. The "attention is all you need" paper was by the Google Translation team as they came up with ideas to improve how to map context of words and carry them across in translations.

At some point people started asking the translation to "translate from English to English as if you're an AI assistant".

Anyway it shouldn't surprise anyone that LLMs are good at translation. The real surprise to everyone is how powerful translation engines that understood context could be!

2 comments

One distinction is the original transformer was an encoder/decoder while (most?) LLMs today are encoder only.

The translation transformer also was able to peek ahead in the context window while (most?) LLM's now only consider previous tokens.

They're usually thought as "decoder only"
Oops yes thank you, was late when I replied.
I like to think of it as if the LLM is simply translating questions into answers.