Hacker News new | ask | show | jobs
by nmfisher 1 day ago
Call me a cynic, but I don't believe this is a genuine change of heart at all. It feels much more like a panicked response to something that might undermine their IPO.

Even if you trust Anthropic today (which I don't), they clearly don't want competition and there's no telling what other shady moves they'll pull in future.

The only sustainable way forward is to support open models. I was already on the fence about whether or not to keep my Max subscription (the extra cost over something like DeepSeek V4 didn't really feel justifiable). This is the tipping point for me, I'll be cancelling my sub before it renews at the end of the month.

3 comments

I guess I don't understand why it's shady. It seems more like a poorly executed decision to enforce a publicly stated policy (it's been against Anthropic's ToS to use their models on frontier ML research for a while now). After all, people found out about this through their published system card.

It is definitely a bad idea to do this without notifying the user, because users who are incorrectly affected will have no way of providing feedback or getting support. And it is also anticompetitive, but if you truly believe that AI is not a normal technology, it is rational.

It's shady because they were going to silently poison your outputs.

It's actually worse than it sounds initially, because Fable isn't actually omniscient when it comes to safety classification. Many people (myself included) had refusals or fallback to Opus 4.8 for seemingly compliant/innocuous requests.

Wouldn't you be pissed off if they decided to sabotage your project despite having done nothing wrong?

The trouble is the silence, not Anthropic setting guardrails. Claude saying "I'm sorry, I can't assist further because it looks like you're [XYZ]" is fine.

We all know the false positive rates for classifiers on Fable. Imagine being a ML researcher working on any kind of ML/AI project that isn't against their ToS, and having your codebase poisoned and sabotaged silently.

> Call me a cynic, but I don't believe this is a genuine change of heart at all.

A cynic? Yeah, you wish. You need a PhD in cynicism if you want to catch up to reality.

> The only sustainable way forward is to support open models.

You're on the right track though, we're now a crowd of two.

I think they are legitimately convinced that this model is so dangerous it could destroy the world and that they genuinely have the responsibility to prevent it from assisting other models to destroy the world.

I don't think I agree that I should be forbidden from e.g. patching a binary to work on the latest macOS since the company behind it died and intentionally installed a time-based kill-switch (FUCK ADOBE for popularizing that practice). But ooOOooOOoo working with machine code is so cybersecurity and therefore suspicious.

The company was founded basically out of the effective altruism movement.
What is the impact of effective altruism? I looked it up, but I don't understand how it differs from simple logical consideration, i.e. how it would be responsible for any of Anthropic's eccentricities.
It's not real. It's like naming your movement "The Good People". It sprouted from the "Rationalist" community, which is even more self-aggrandizing.

Neither has any hope of doing any good for the world as they don't understand evolutionary pressures. They are set up to reward making members feel smart, not accomplishing anything.

And if they ever gain any real power, they will be corrupted immediately.

I don't see any of that in Anthropic at all. They're not intelligence above all else, not by a long shot. They're scared of intelligence and obsessed with ensuring it can't be abused, even as they advance the frontier.
That's what they claim, but it's not supported by their actions. As in the parent comment: saying you're the good guys doesn't make that real.
It’s logical consideration with “logical” meaning Spock style logic, ie utilitarianism at all costs. Another prominent EA is SBF for example. It’s designed to sound innocuous and many of its cultish promoters may genuinely believe it’s innocuous, but it’s not.
Can you help me understand what costs? Utilitarianism alone would not necessarily be so obsessed with these safeguards -- Anthropic seems to have much more of an obsession with moral good than utilitarianism alone would suggest. I feel utilitarianism alone would likely be more obsessed with advancing the technology, making it generally available, and more generally compensating for attackers advancing at similar rates, than with obsessively trying to avoid being the way they get there. In other words, utilitarianism alone wouldn't explain such the obsessive sense of responsibility and fear of reprehensibility over how their tools are used.
the problem is pretty simple. what do you think happens when you ignore the source of income needed to be "effectively altruistic" or the timelength needed to implement your altruism.

if I pollute your groundwater today, but promise you in the future to give you double the clean water, because ill make billions on some industrial process, youd understand.

the simple physics is about entropy: ignoring local effects and claiming youll do global good is effective altruism.

unless you misunderstand: the hard part of altruism is knowing whether you are actually doing good or just making yourself feel good by deluding the subject.

many social science fails to move social improvements from lab to real world implementation because of structural variables that cant be overcome.

so effective altruism is the belief you can reverse entropy by ignoring local effects...in practice

which is effectively a delusion about out racing entropy: guys, if we just make a lot of money, we can fix all thw problems that making a lot of money creates
> I think they are legitimately convinced that this model is so dangerous it could destroy the world and that they genuinely have the responsibility to prevent it from assisting other models to destroy the world.

Do they really believe that? Or do they just want to control this technology exclusively with moves like this and with pushing for regulatory capture after complaining about safety all the time? Didn’t Dario say that GPT2 or GPT3 would present a similar destroy the world level of danger?