|
|
|
|
|
by brucethemoose2
929 days ago
|
|
Its presumably llama just like Mistral. Everything open source is llama now. Facebook all but standardized the architecture. I dunno about the moe. Is there existing transformers code for that part? It kinda looks like there is based on the config. |
|
https://github.com/mistralai/mistral-src