| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by hypfer 2 days ago

The same 24GB VRAM RTX 4090 I bought to play Cyberpunk 2077 with.

Works perfectly fine in llama.cpp throwing 70+t/s at me with 128k q8 K/V context when using the IQ4_NL quant + MTP at q4 MTP K/V.

Also leaving this here because you might find it useful: https://hypfer.github.io/will-it-fit-llama-cpp/

3 comments

indoordin0saur 2 days ago

Nice! Do you do anything with that compute when you're not actively using it? Is the crypto-mining hobby still worth it? I've also wondered if such expensive hardware can be rented back out to offset cost. Looks like these cards are going for as much as $4k nowadays.

link

all2 2 days ago

There are services where you can hook your card up and rent it out to other users. I don't know what any of them are called, but they do exist.

link

dghlsakjg 2 days ago

Salad.com is one. (I’m unaffiliated, just happened to come across it this week while looking for a cheap option)

link

hypfer 2 days ago

I've paid ~2k€ in 2023. Since I'm usually sitting next to it, I'm only using it when I want to use it. It can get quite loud and warm.

Crypto (to my knowledge at least) moved away from GPU mining. I guess you could maybe rent out GPU compute, but - being in germany - it's not worth the legal hassle. You could of course always commit tax fraud, though I wouldn't recommend that.

link

esseph 2 days ago

> I've also wondered if such expensive hardware can be rented back out to offset cost.

Massive legal liability. Not worth it.

link

Rzor 2 days ago

Can you fix MTP-GEMMA-4-26B-A4B-IT? It says the weights are 0.5 GB in size.

edit: nvm, I'm confusing models.

link

cdelsolar 2 days ago

What did you call me?

link