Hacker News new | ask | show | jobs
by echelon 3 days ago
This is why we need open weights for everything.

Nobody will cry when their AI girlfriend model gets revoked. You'll always have the weights.

Presumably for the low cost of spinning up an H200 or two you can use the weights forever.

No more claiming your LLM gets nerfed. No more claiming your video model can't do Spider-Man anymore.

6 comments

I think my main concern was productivity, but tell me more about this AI Girlfriend
Darling, we'll always have W_q, W_k, W_v, and W_o.
H200 is not cheap, and I don't think you can run DeepSeek with full weight without any quantization on even two of them.

Although open weights in theory are good, especially for developers and market competition, it is not as wonderful as you thought.

> Nobody will cry when their AI girlfriend model gets revoked

These are the people who cry the loudest and there’s not a close second. They have infinite time to whine online (see /r/chatgpt after 4.5 went away).

These models are far too expensive to run yourself and independent LLM providers of open models do even more secret nerfing than the original creators because they have no reputation to lose.
It's not just the weights. It is the system prompt, harness, safety filters, etc. Those can affect performance of the same underlying model significantly.