Just a few days ago, on Friday, my 15 year old son had his Claude account suspended with a demand for ID to prove he is 18 or older. He had his own Claude Max subscription (he out-earns me fairly frequently in his circle of gaming programmers), and was unaware Anthropic had a must-be-18 rule, as was I. Their email said "Our team found signals that your account was used by a child. This breaks our rules, so we paused your access to Claude." So I guess if you ever ask a question that seems to originate from a teen or less, expect to hit an ID gate.
So now he's a Codex user. OpenAI and Google both have a minimum age of 13.
EDIT: I should note that Anthropic gave him a refund for the whole month that was underway, despite him being nearing the end of it. So good on them.
Who said anything about "vibe coding"? Using coding tools like Claude Code as just another tool in the belt is something the overwhelming bulk of professional devs do now (and given that my son managed to find a number of clients paying for his work, he qualifies as professional). Pejorative "vibe coding" nonsense doesn't change this.
Call it whatever you want then, I'm still interested in the question at hand – your son makes more money than you do professionally, using Claude Code to make something video game related?
He makes solutions for people and they pay him money for doing so. I mean...pretty much exactly how we all operate? He's excellent at networking and has built an enormous connection tree.
Yeah. I do not get the 18-years-old age gate. It's not like they're protecting anyone. AI is available so freely now anyone who wants it can get it.
Anthropic made the best models by hiring non-technical folks like philosophers to build the best training sets and evaluations. Now, it seems like their philosophers are telling people how they can and can't use their model.
Liability. OpenAI have had several court cases now I believe where children killed themselves after interacting with ChatGPT. Less liability if the user is an adult.
People have tried to run Qwen3-235B-A22B-Thinking-2507 on 4x $600 used, Nvidia 3090s with 24 GB of VRAM each (96 GB total), and while it runs, it is too slow for production grade (<8 tokens/second). So we're already at $2400 before you've purchased system memory and CPU; and it is too slow for a "Sonnet equivalent" setup yet...
You can quantize it of course, but if the idea is "as close to Sonnet as possible," then while quantized models are objectively more efficient they are sacrificing precision for it.
So next step is to up that speed, so we're at 4x $1300, Nvidia 5090s with 32 GB of VRAM each (128 GB), or $5,200 before RAM/CPU/etc. All of this additional cost to increase your tokens/second without lobotomizing the model. This still may not be enough.
I guess my point is: You see this conversation a LOT online. "Qwen3 can be near Sonnet!" but then when asked how, instead of giving you an answer for the true "near Sonnet" model per benchmarks, they suddenly start talking about a substantially inferior Qwen3 model that is cheap to run at home (e.g. 27B/30B quantized down to Q4/Q5).
The local models absolutely DO exist that are "near Sonnet." The hardware to actually run them is the bottleneck, and it is a HUGE financial/practical bottleneck. If you had a $10K all-in budget, it isn't actually insane for this class of model, and the sky really is the limit (again to reduce quantization and or increase tokens/second).
PS - And electricity costs are non-trivial for 4x 3090s or 4x 5090s.
Qwen3.5-35B-A3B is reported to perform slightly better than the model you mentioned.
It runs fine but non-optimal on a single 3090 with even 131072 tokens of context , and due to the hybrid attention architecture, the memory usage and compute scale rather less drastically than ctx^2. I've had friends with smaller cards still getting work out of it. Generation is at around 20 tokens/sec on that 3090 (without doing anything special yet) . You'll need enough DRAM to hold the bits of the model that don't fit. Nothing to write home about, but genuinely usable in a pinch or for tasks that don't need immediate interactivity.
It's the first local model that passes my personal kimbench usability benchmark at least. Just be aware that it is extremely verbose in thinking mode. Seems to be a qwen thing.
(edit: On rechecking my numbers; I now realize I can possibly optimize this a lot better)
> The hardware to actually run them is the bottleneck, and it is a HUGE financial/practical bottleneck.
That's unsurprising, seeing as inference for agentic coding is extremely context- and token-intensive compared to general chat. Especially if you want it to be fast enough for a real-time response, as opposed to just running coding tasks overnight in a batch and checking the results as they arrive. Maybe we should go back to viewing "coding" as a batch task, where you submit a "job" to be queued for the big iron and wait for the results.
A machine with 128GB of unified system RAM will run reasonable-fidelity quantizations (4-bit or more).
If you ever want to answer this type of question yourself, you can look at the size of the model files. Loading a model usually uses an amount of RAM around the size it occupies on disk, plus a few gigabytes for the context window.
Qwen3.5-122B-A10B is 120GB. Quantized to 4 bits it is ~70GB. You can run a 70GB model in 80GB of VRAM or 128GB of unified normal RAM.
Systems with that capability cost a small number of thousand USD to purchase new.
If you are willing to sacrifice some performance, you can take advantage of the model being a mixture-of-experts and use disk space to get by with less RAM/VRAM, but inference speed will suffer.
I thought it was true too, for a couple of months. Then the honeymoon phase ended and now I only use Claude to write commit message drafts (which I rewrite myself) and review PRs.
It seems out of step and foolish, and the cynic in me says that Anthropic has a side hustle of identity harvesting and is looking for justifications, but on the flip side, there is a real risk of pearl clutching if a child ever uses AI, and maybe Anthropic just wants to steer clear of all of that. Though simply putting it in the ToS should be sufficient legal shielding, and the idea that they're chat harvesting to age fingerprint conversations seems dubious.
An equally valid question is "does the company you use for identify verification follow the same commitments with regards user privacy and selling/processing of user data as Anthropic itself?".
And the answer to that question is:
"Hell no! We used the cheapest, shadiest company we could find for that. They'll process and sell all your data. Thank you for continuing to be a valued Anthropic customer!".
* preventing North Korea, China, Russian, Iran and etc. actors from accessing service. They absolutely use workarounds to access AI, e.g. I bet there are companies who are proxy between Anthropic and those countries.
I imagine there will be quite some false positives while identifying those.
This will do absolutely nothing to prevent those actors from accessing Claude... they already recruit young unemployed Americans to do proxy job interviews[0][1], etc. They'll just pay young unemployed Americans to do verification for them.
That sounds likely to increase their costs and create new opportunities to get caught. Not a silver bullet but not "absolutely nothing". Like how anti-money laundering laws don't wipe out all crime, but are still worthwhile.
If the API costs are gonna be thousands, or the subscription will be $20/month, is it really that expensive to pay some guy on Discord a $50 gift card to verify the account as a one-time setup? Better yet, we'll probably start seeing fake porn websites and other phishing sites that ask to verify your age but end up proxy verifying a bunch of these services in an automated manner with minimal costs, and you'll be able to buy verified Claude accounts for tens of cents on account marketplaces. Just as you have been able to buy verified Discord accounts, aged Steam accounts, etc...
They request ID for bans so that they can ban you personally. ID checks may as well be a sign that you've already been banned and they're fishing for ways to make the ban harder to evade. Venmo does the same thing.
Maybe Anthropic just likes creating a market for dark identities. Because that's the most likely effect of such stupidity; generating more ID theft victims with no change to services to criminals.
Is a "dark identity" one that's never been shared with an identity-theft-as-a-service? Or is it just of one that's (supposed to be) privacy-conscious (and wouldn't otherwise have been an easy victim)?
> ID checks may as well be a sign that you've already been banned and they're fishing for ways to make the ban harder to evade.
So identity verification is basically a canary that your account is about to get banned, or is on the chopping block. At that point you're better off abandoning ship rather than handing over your ID.
Basically exactly my point. If you could use the service without ID verification, and others can still use the service without ID verification, but you've been blocked because you haven't handed over your ID, then leave or start a new account. That is if you're averse to being banned personally. If you don't mind the risk then you can verify ID and prepare to jump ship if it's a ban.
Wouldn't the reasons for requesting identification be the same those for banning people - the system has flagged that you might be from the wrong location/under 18/creating multiple free acounts etc - so is validating.
So now he's a Codex user. OpenAI and Google both have a minimum age of 13.
EDIT: I should note that Anthropic gave him a refund for the whole month that was underway, despite him being nearing the end of it. So good on them.