| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by zinodaur 5 days ago
	Oh no, someone is profiting off of their work without proper attribution!?!?

5 comments

Aurornis 5 days ago

This is an open weights model based on other open weights models.

The dispute is that they released it with claims about having done some post training that improved the outputs. It was discovered that the model was not post trained like they claimed.

The HF page now says it’s a merge of models, which wasn’t there before. They’re trying to claim they accidentally uploaded the wrong model to HF and that they’ll upload the real one soon.

Basically, they thought they could splice two open weights models together and claim their team had accomplished some amazing post training, but they weren’t smart enough to realize that other researchers would discover that there wasn’t any post training.

moritzwarhier 5 days ago

Thanks for the factual clarification. This is so important when everyone already has their trigger finger on politics. Not meaning that politics are irrelevant here, see sister comment by jobim.

But it's impossible to form a nuanced opinion when political association has a higher priority than the facts; which, again, don't look flattering for the implementers.

iknowstuff 5 days ago

How do they just splice two models together?

Aurornis 5 days ago

The Nex N2 model they merged is based on Qwen 3.5, so you can swap pieces of one into the other. They found a combination of the two that did well on some benchmarks and shipped it.

In the early days of Llama there were a lot of experiments like this. There were even some interesting combinations of models where they stacked layers of different models together or even added more layers with interesting results.

But announcing that you spliced two models together isn't very impressive in 2026, so they announced that they had done their own post training and outdid the big labs. They thought nobody would look close enough to notice.

ninja3925 5 days ago

Out of curiosity, how was it discovered? You would have to look for it to find this linear combination.

Aurornis 5 days ago

Check the linked GitHub issue. They explain their process.

Scroll past the first issue to find it. It’s further down.

jdiff 5 days ago

Without the system prompt, asking its name results in it responding with the name of the model they're ripping from. That would certainly draw your eyes to the right places.

valleyer 5 days ago

Why is this? Do labs reinforce the model name during training? I was under the impression that this sort of "self-knowledge" always came from the system prompt, but I guess not...

jdiff 5 days ago

Yes. In this case, during fine tuning. Other blurbs are also baked in during fine tuning that are perfectly reproducible from the Nex model. The details inside the linked issue are quite accessible.

s1artibartfast 5 days ago

How do you feel about the government or government contractors saying they did a bunch of work when they did nothing instead?

carlosjobim 5 days ago

This is a pure scam on tax payer money. But what else would be expected?

hootz 5 days ago

Apparently no public money was involved.

jdiff 5 days ago

This is contrary to the mayor's words on Twitter.

> An open AI model trained in Rio with public funding over the last year by @Prefeitura_Rio surpassing all other models.

https://x.com/CavaliereRio/status/2065984620626129026

jrm4 5 days ago

Unlike the big companies who do this, which often are merely impure scams on tax payer money a little more downstream.

philipallstar 5 days ago

Companies that generate loads of corporation tax, income tax, and VAT revenue are the exact opposite of wastes of public money.

jrm4 5 days ago

Yes, when they do so proportional to what they take, especially as compared to individuals and their tax liabilities.

You'll have to let me know when that finally happens, because that ain't now.

philipallstar 5 days ago

Sorry, I've no idea how to read your first sentence.

Your second one - that's how everything public is paid for. Private individuals pay tax, either through their corporations paying corporation tax or the tax bill on top of their wage bills, which a) drives up prices of the goods and services they offer, or depresses wages, and b) funds all the public sector employees and orgs that don't pay tax (orgs) or don't pay net tax (employees).

jrm4 4 days ago

Not surprising.

The point of my first sentence is; private individuals and small businesses generally pay their fair share. Larger corporations emphatically do not.

carlosjobim 5 days ago

Great, now we're defending embezzlement and fraud with public funds on HN, because we really really hate big business.

A child caught doing something bad will cry "but my friends also did it!", is that the level of reasoning hackers want to be at?

blanched 5 days ago

That seems like a bad faith read to me. Nobody is defending it, just pointing out the irony / hypocrisy. Two things can be bad, and they can be related.

carlosjobim 5 days ago

You'd be surprised to hear then that I'm not the owner of any big company which embezzles tax payer money, and have never been involved in such.

blanched 5 days ago

I don’t follow how that makes sense as a response to what I said?

sdevonoes 5 days ago

There are no hackers around here anymore. HN is mainly about business nowadays

dmix 5 days ago

HN has always discussed business

jrm4 5 days ago

What part of that said "defense?"

They can both be bad.

lostlogin 5 days ago

> Great, now we're defending embezzlement

I might be missing something, but I don’t see anyone defending the the scams.

internet2000 5 days ago

Attribution isn't the relevant part. Lying about your lab's capabilities is.

Planktonne 5 days ago

That's also something all the AI companies have been doing.

dofm 5 days ago

Lying about model capability is right now the lingua franca of the cloud AI business model, almost; they yes-and each other's lies because they are in a position of needing to generate interest, including going as far as needing to trigger regulatory capture.

(It's not news to anyone who has worked in sales-led businesses that salespeople are prone to believing the claims of other salespeople, I guess).

selcuka 5 days ago

> Lying about model capability is right now the lingua franca of the cloud AI business model

Lying about your lab's capabilities != Lying about model capability

Exaggerating the capabilities of a new model that you've actually trained in press bulletins can be called marketing. Merging two models and claiming that you trained a new model is plain lazy.

low_tech_love 5 days ago

They’re using public money to “train” this.

vips7L 5 days ago

Sounds like the whole AI movement.

themafia 5 days ago

It seems to me like the lies are both for the same reason. To capture attention and profits that are not deserved.

functionmouse 5 days ago

leopards ate my face

outside2344 5 days ago

But the whole game is lying and stealing isn't it?

adrian_b 5 days ago

I do not see anyone lying.

The model card says:

> Post-trained from Qwen 3.5 397B

The model card also says that they use an inference framework based on "SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs" by Shi et al.:

https://arxiv.org/abs/2510.05069

So the sources seem properly attributed.

They only claim that what they did to "Qwen 3.5 397B" has improved the LLM, including, as expected, with "strong performance in Portuguese".

petu 5 days ago

That's attribution to Qwen team.

There (is/was) no attribution to Nex team (they've released a model based on Qwen 3.5 397B as well).

As per OP link Nex claims that what Rio team released (so far) is just linear interpolation of weights between Nex and OG Qwen model. With no attribution to Nex and zero signs of Rio doing any training of their own.

00index 5 days ago

Are you talking about the credit that was just updated an hour ago? lol

bachmeier 5 days ago

"Their work"? First you had the original content creators that did 99.99% of the work. Then you had the US companies bundle it up into a frontier LLM. Then "they" did the "work" of using the US model as a foundation for their own. So in the sense of doing 0.00001% of the actual work that went into their product, sure.

I'd say it's more like someone forking a Linux distro, adding a few themes and fonts, and then complaining when someone else forks their distro and adds another theme.

dghlsakjg 5 days ago

That’s the joke.

bachmeier 5 days ago

It isn't. The entirety of the comment I responded to is "Oh no, someone is profiting off of their work without proper attribution!?!?" It's a valid point, but references someone using content created by others for profit. I'm objecting to equating this project with the work done by the original content creators. They're not remotely the same thing.

I understand how the internet works and how people respond to others in this type of setting, but the comment I replied to did not in any way make the point I was making about the disproportionate nature of relative contributions.

vasco 5 days ago

> I understand how the internet works and how people respond to others in this type of setting,

You should frame this as a reminder to be more charitable in your positions because sometimes you can be wrong. This subthread ended being one of the funniest I've read recently.

idiotsecant 5 days ago

It's time to stop digging

dghlsakjg 5 days ago

> It isn’t

It is.

> I understand how the internet works and how people respond to others in this type of setting, but the comment I replied to did not in any way make the point I was making about the disproportionate nature of relative contributions.

Do you understand?

Jokes aren’t that funny when you have to dig into an explanation on the nuance of why the hidden meaning doesn’t match the surface meaning in exact degree and proportions. That turns a joke into a pedantic comment. And paradoxically muddies the point by explaining it.

We aren’t morons. We understand that Picasso is doing something on a different level than someone feeding bulk scraped JPGs of paintings into a python script. You really don’t have to explain.

bachmeier 5 days ago

Have a nice day.

bwilliams18 5 days ago

That was the joke of the parent comment.

JoshStrobl 5 days ago

That joke really went over your head, huh...

harikb 5 days ago

It is only a problem if you claim it to be an independently developed OS with no attribution to base

idiotsecant 5 days ago

Oof this is delete your post level I think. Sorry bud, I been there.