| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by jwitthuhn 613 days ago

For anyone else looking for the weights which as far as I can tell are not linked in the article:

Base model: https://huggingface.co/Zyphra/Zamba2-7B

Instruct tuned: https://huggingface.co/Zyphra/Zamba2-7B-Instruct

1 comments

keyle 613 days ago

I couldn't find any gguf files yet. Looking forward to trying it out when they're available.

link

kristianp 613 days ago

It seems that zamba 2 isn't supported yet, the previous model's issue is here:

Feature Request: Support Zyphra/Zamba2-2.7B #8795

Open tomasmcm opened this issue on Jul 31 · 1 comment

https://github.com/ggerganov/llama.cpp/issues/8795

link

alchemist1e9 613 days ago

What can be used to run it? I had imagined Mamba based models need a different interference code/software than the other models.

link

gbickford 613 days ago

If you look in the `config.json`[1] it shows `Zamba2ForCausalLM`. You can use a version of the transformers library to do inference that supports that.

The model card states that you have to use their fork of transformers.[2]

1. https://huggingface.co/Zyphra/Zamba2-7B-Instruct/blob/main/c...

2. https://huggingface.co/Zyphra/Zamba2-7B-Instruct#prerequisit...

link

hidelooktropic 613 days ago

To run gguf files? LM Studio for one. I think recurse on macos as well and probably some others.

link

x_may 612 days ago

As another commenter said, this has no GGUF because it’s partially mamba based which is unsupported in llama.cpp

link

xyc 612 days ago

dev of https://recurse.chat/ here, thanks for mentioning! rn we are focusing on features like shortcuts/floating window, but will look into support this in some time. to add to the llama.cpp support discussion, it's also worth noting that llama.cpp does not yet support gpu for mamba models https://github.com/ggerganov/llama.cpp/issues/6758

link

wazoox 612 days ago

Gpt4all is a good and easy way to run gguf models.

link

Havoc 612 days ago

Mamba based stuff tends to take longer to become available

link