Hacker News new | ask | show | jobs
by sp332 930 days ago
I don't see any code in there. What runtime could load these weights?
1 comments

Its presumably llama just like Mistral.

Everything open source is llama now. Facebook all but standardized the architecture.

I dunno about the moe. Is there existing transformers code for that part? It kinda looks like there is based on the config.

Mistral is not llama architecture.

https://github.com/mistralai/mistral-src

Its basically llama architecture, all but drop in compatible with llama runtimes.
Because it's JSON? :)