Hacker News new | ask | show | jobs
by jncraton 1095 days ago
That's correct. The current base model is an int8 quantization of LaMini-Flan-T5-248M described here:

https://github.com/mbzuai-nlp/lamini-lm

I shared more details over on Reddit:

https://www.reddit.com/r/LocalLLaMA/comments/14btk3a/explore...