Hacker News new | ask | show | jobs
by jxy 811 days ago
FLAN-T5-XL, a 3B model, to be precise.