Hacker News new | ask | show | jobs
by bob1029 445 days ago
If we want to get pedantic, in context of practical transformer models it has never been text to text. It has always been tokens to tokens. The tokens can represent anything. "multimodal" is a marketing term.