|
|
|
|
|
by verdverm
151 days ago
|
|
Flash is not a small model, it's still over 1T parameters. It's a hyper MoE aiui I have yet to go back to small models, waiting for the upstream feature / GPU provider has been seeing capacity issues, so I am sticking with the gemini family for now |
|