Hacker News new | ask | show | jobs
by nicce 252 days ago
> This means OAI won't need ads.

Ads are defenitely there. Just hidden so deeply in the black box which is generating the useful tips :)

3 comments

If you ask it to build a headless frontend web app, it immediately starts generating code with Next.js. I’ve always wondered how it was trained to default to that choice, given the smorgasbord of web frameworks out there. Next.js is solid, but it’s also platform-ware, tightly coupled to commercial interests. I wish there were more bias toward genuinely open-source technologies.
There's probably different ways the LLM converged to it.

One could be for example: from people asking online which tools they should use to build something and being constantly recommended to do it with Next.js

Another could be: how many of the code that was used to train the LLM is done in Next.js

Generally, the answer is probably something along the lines of "next.js is kind of the most popular choice at the time of training".

To me it feels like the default choice in the industry, perhaps it's not and I'm wrong but if I could have that feeling I can see how the AI can as well.
I've never seen next.js in the wild. I have seen plain React plus dotnet, though, a million times.
It is a trap. But once you realise that you are already too deeply invested.
Just append to your prompt "not using a framework developed by a company that supports a genocidal fascist regime"
I wonder what the ad labeling (according to EU law) would look like in that case.

In my (non-lawyer) understanding, each message potentially containing sponsored content (which would be every message, if the bias is encoded in the LLM itself,) would need to be marked as an ad individually.

That would make for an odd user interface.

Because the AI labs are just hovering up all internet text that they can, I've been seeing more and more marketing pilots that deliberately seed marketing material in thousands of fake, AI-generated blogs and tutorials. The intention here is to get new LLMs to train on these huge numbers of associations between specific use cases and the company's product. All in a way that gets their marketing information into the final weights.

You may have started seeing this when LLMs seem to promote things based entirely on marketing claims and not on real-world functionality.

More or less, SEO spam V2.