Hacker News new | ask | show | jobs
by Deathmax 394 days ago
It's not a 4B parameter model. The E4B variant is 7B parameters with 4B loaded into memory when using per-layer embedding cached to fast storage, and without vision or audio support.
1 comments

The link says E2B and E4B have 4B and 8B raw parameters, where do you see 7B?
There's a 7B mentioned in the chat arena ELO graph, I don't see any other references to it though.
Hi! The model is 8B if you also load the vision and audio components. We just used the text model in LMArena.
Are vision and audio components available yet?