Hacker News new | ask | show | jobs
by Philpax 35 days ago
I regret that the projection models ended up separate, and I too would have preferred for them to be in a single file. I'm not entirely sure why that ended up happening, but it very much runs counter to the single-file ethos I had in mind when I designed GGUF.

Hoping that someone will shepherd the cause of merging the two; I think I'm too out of the loop to do it this time around :-)

3 comments

Well considering right now MTP support is being developed, there was a conversation in that that seemed to throw around the idea of separating the MTP model out of the main GGUF, like with Mmproj. This was rejected.

Which I'm happy for. So given that decision, I don't think it's unreasonable to think that they might be open to including Mmproj files in the GGUF.

Only issue I can think of is, which one? BF16, F16? Etc

Quantiser's choice, IMO. They're best-placed to decide what compromise to make for their particular model.
hi, first post. 56 year old world's foremost procedural audio programmer xoxos vst, wrote the world's first procedural lyrical engine in 1994.

just about every field has documented cancellation of my egalitarian work for the apron brethren. it would be nice if this species could explain how to use a text to image without leaving which mess of 30g worth of downloads to try.

please, just someone SAY what things i need just once simply without going "you need 5G 5G 5G 5G 5G 5G"

your species doesn't work since the Emm Kay heterodyning from orbit. since rely, natural kinda ears to west papua FOR A REASON

Currently not many people would finetune mmproj, so mmproj is reusable. The mmproj for Qwen 3.6 27B can be reused on all its finetunes. While the MTP model usually needs to be finetuned with the main model to get the best performance, which is being studied in Heretic.