| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Tepix 124 days ago
	It's insane how much traffic HF must be pushing out of the door. I routinely download models that are hundreds of gigabytes in size from them. A fantastic service to the sovererign AI community.

3 comments

razster 123 days ago

My fear is that these large "AI" companies will lobby to have these open source options removed or banned, growing concern. I'm not sure how else to explain how much I enjoy using what HF provides, I religiously browse their site for new and exciting models to try.

culi 123 days ago

ModelScope is the Chinese equivalent of Hugging Face and a good back up. All the open models are Chinese anyways

thot_experiment 123 days ago

Not true! Mistral is really really good, but I agree that there isn't a single decent open model from the USA.

culi 123 days ago

Mistral is cool and I wish them success but it consistently ranks extremely low on benchmarks while still being expensive. Chinese models like DeepSeek might rank almost as low as Mistral but they are significantly cheaper. And Kimi is the best of both worlds with incredible benchmark results while still being incredibly cheap

I know things change rapidly so I'm not counting them out quite yet but I don't see them as a serious contender currently

thot_experiment 123 days ago

Sure, benchmarks are fake and I use Mistral over equivalently sized models most of the time because it's better in real life. It runs plenty fast for me, I don't pay for inference.

BoredomIsFun 123 days ago

> it consistently ranks extremely low on benchmarks

As general purpose chatbots small Mistral models are better than comparably sized Chiniese models, as they have better SimpleQA scores and general knowledge of Western culture.

seanmcdirmid 123 days ago

It’s really hard to beat qwen coder, especially for role play where the instruction following is really useful. I don’t think their corpus is lacking in western knowledge, although I wonder if Chinese users get even better results from it?

Eupolemos 123 days ago

Why are you talking price when we are talking local AI?

That doesn't make any sense to me. Am I missing something?

dirasieb 123 days ago

15 missed calls from your local power company

culi 123 days ago

Your electricity is free?

ac29 121 days ago

Arcee is working on that, see a blog post about their newest in progress model here: https://www.arcee.ai/blog/trinity-large

Its still not fully post trained and its a non-reasoning model, but its worth keeping an eye on if you dont want to use the Chinese models that currently are the best open-weight options.

CamperBob2 123 days ago

To be fair there are lots of worse models than OpenAI's GPT-OSS-120b. It's not a standout when positioned next to the latest releases from China, but prior to the current wave it was considered one of the stronger local models you can reasonably run.

throwaway27448 123 days ago

They can try. I don't think they'll be able to get the toothpaste back in the tube. The data will just move our of the country.

seanmcdirmid 123 days ago

Many of the models on hugging face are already Chinese. It’s kind of obvious that local AI is going to flourish more in China than the USA due to hardware constraints.

dotancohen 123 days ago

How do you choose which models to try for which workflows? Do you have objective tests that you run, or do you just get a feel for them while using them in your daily workflow?

toofy 123 days ago

it’s only a matter of time. we have all seen first hand how … wrong … these companies behave, almost on a regular basis.

there’s a small tinfoil hat part of me that suspects part of their obscene investments and cornering the hardware market is driven by an conscious attempt to stop open source local from taking off. they want it all, the money, the control, and to be the only source of information to us.

Onavo 123 days ago

Bandwidth is not that expensive. The Big 3 clouds just want to milk customers via egress. Look at Hetzner or CloudFlare R2 if you want to get get an idea of commodity bandwidth costs.

vardalab 123 days ago

Yup, I have downloaded probably a terabyte in the last week, especially with the Step 3.5 model being released and Minimax quants. I wonder what my ISP thinks. I hope they don't cut me off. They gave me a fast lane, they better let me use it, lol

fc417fc802 123 days ago

Even fairly restrictive data caps are in the range of 6 Tb per month. P2P at a mere 100 Mb works out to 1 TiB per 24 hours.

Hypothetically my ISP will sell me unmetered 10 Gb service but I wonder if they would actually make good on their word ...

3eb7988a1663 123 days ago

I have a 1.2TB cap before you start getting charged extra, so you might need to recalibrate your restrictive level.

fc417fc802 123 days ago

Is that with a WISP by chance? Or in a developing country? Or are there really wired providers with such low caps in the western world in this day and age?

Zetaphor 123 days ago

ATT once told me if I don't pay for their TV service then my home gigabit fiber would have a 1TB cap. They had an agreement with the apartment building so I had no other choice of provider.

fc417fc802 123 days ago

Buy our off brand netflix or else we'll make it so you can't watch netflix. How is that legal?

nagaiaida 123 days ago

well it's my wired cap a stone's throw from buildings with google cloud logos on the side in a major us city, so...

zargon 123 days ago

Comcast.