Hacker News new | ask | show | jobs
by rolisz 1093 days ago
Flan-T5 is a 3B model that is of comparable quality to Llama 13B.

Moreover, you can fine-tune model for your specific tasks and you need fewer resources to fine tune a smaller model.