Hacker News new | ask | show | jobs
by thorum 831 days ago
It’s impressive how well the T5 family of models has aged, even compared to newer LLM architectures.
1 comments

encoder decoder vs decoder only