| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by angilly 806 days ago
	The lack of a corresponding announcement on their blog makes me worry about a Twitter account compromise and a malicious model. Any way to verify it’s really from them?

3 comments

simonw 806 days ago

Their https://twitter.com/MistralAI account has 5 tweets since the account opened, three of which were model release magnet links.

https://twitter.com/MistralAILabs is their other Twitter account, which is very slightly more useful though still very low traffic.

link

swyx 806 days ago

you must be new to mistral releases. they invented the magnet first blog later meta

link

angilly 806 days ago

At 3:30a France local? Alrighty. I still wait a lil bit ;)

link

moralestapia 806 days ago

What could a malicious model do, though? Curse at you?

link

Teever 806 days ago

https://arstechnica.com/security/2024/03/hugging-face-the-gi...

link

Tiberium 806 days ago

Not .safetensors though

link

Aissen 806 days ago

Exploit a memory safety issue in the tokenizer/or other parts of your LLM infra written in a native language.

link

moralestapia 806 days ago

??? With weights?

link

fzzzy 806 days ago

There was a buffer overflow or some other exploit like that in llama.cpp and the gguf format. It has been fixed now, but it's definitely possible. Also weights distributed as python pickles can run arbitrary code.

link

abound 806 days ago

There are plenty of exploits where the payload is just "data" read by some vulnerable program (PDF readers, image viewers, browsers, compression tools, messaging apps, etc)

link

sp332 806 days ago

Yes, there's a reason weights are now distributed as "safetensors" files. Malicious weights files in the old formats are possible, and while I haven't seen evidence of the new format being exploitable, I wouldn't be surprised if someone figures out how to do it eventually.

link

llm_trw 806 days ago

This is how they released every model so far.

link