Hacker News new | ask | show | jobs
by bevekspldnw 805 days ago
There is a user called The Bloke on hugging face- they release pre quantized models pretty soon after the full size drop. Just watch their page and pray you can fit the 4 bit in your GPU.

I’m sure they are already working on it.

2 comments

TheBloke stopped uploading in January. There are others that have stepped up though.
Oh really? Who else should I be looking at?

That person is a hero, super bummed!

TheBloke's grant ran out.
I think 4b for this is support to be over 70GB, so definitely still heavy hardware.
Fucking hell, my A6000 is shy of that and I can’t reasonably justify picking up a second.