Y
Hacker News
new
|
ask
|
show
|
jobs
by
thorum
831 days ago
It’s impressive how well the T5 family of models has aged, even compared to newer LLM architectures.
1 comments
htrp
831 days ago
encoder decoder vs decoder only
link