| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by echelon 3 days ago

This is why we need open weights for everything.

Nobody will cry when their AI girlfriend model gets revoked. You'll always have the weights.

Presumably for the low cost of spinning up an H200 or two you can use the weights forever.

No more claiming your LLM gets nerfed. No more claiming your video model can't do Spider-Man anymore.

6 comments

exabrial 3 days ago

I think my main concern was productivity, but tell me more about this AI Girlfriend

link

lucisferre 3 days ago

Darling, we'll always have W_q, W_k, W_v, and W_o.

link

rabbitlord 3 days ago

H200 is not cheap, and I don't think you can run DeepSeek with full weight without any quantization on even two of them.

Although open weights in theory are good, especially for developers and market competition, it is not as wonderful as you thought.

link

paulcole 3 days ago

> Nobody will cry when their AI girlfriend model gets revoked

These are the people who cry the loudest and there’s not a close second. They have infinite time to whine online (see /r/chatgpt after 4.5 went away).

link

tsss 3 days ago

These models are far too expensive to run yourself and independent LLM providers of open models do even more secret nerfing than the original creators because they have no reputation to lose.

link

flumes_whims_ 3 days ago

It's not just the weights. It is the system prompt, harness, safety filters, etc. Those can affect performance of the same underlying model significantly.

link