Hacker News new | ask | show | jobs
by viraptor 859 days ago
> for releasing new models is extremely low-information

To be fair, this is not a release. This was the previous release https://mistral.ai/news/mixtral-of-experts/

It looks more like not trying very hard to hide things until release, rather than being a black box.

1 comments

If this were the first incident like this I would agree, but they very intentionally dropped the magnet link for Mixtral on Twitter with no further context. That leaves me wondering if this was also a weird on purpose thing rather than just them being casual.
Does it matter? You know that if you really want to play with things early, you may get an opportunity. And if you want to read more details, you'll get an announcement too. What's the problem with it being either on purpose or casual?
It's an observation, not a complaint. It does leave very little to go on for an HN discussion, though, besides a meta conversation like this one.
Well, what could they say? Given the lack of transparency on the data it well could be:

“We’ve trained LLaMA MoE on a lot of GPT4 data. And this it is not as good as GPT4. And this is our blob, so we can release it under any license. If someone is silly enough to use what this blob generates, this is not our problem.”