Hacker News new | ask | show | jobs
by puppystench 54 days ago
They ran a bounty on Kaggle last year but with $500k in payouts and with all results open and publishable.

https://www.kaggle.com/competitions/openai-gpt-oss-20b-red-t...

With only $25k in payouts and everything locked down under NDA, I can't imagine many people will participate. Well, other than those submitting mountains of LLM-generated junk.

4 comments

> Well, other than those submitting mountains of LLM-generated junk.

Assuming somehow some of them use halfway decent models and prompts… They successfully pushed some of the token cost of their analysis work off on customers!

this economic model works for all 'bounty' related work
They claim their models have PhDs but they still can't automate their own red teams. The bounty is not a bounty, it is for gathering training data so that they can claim for the next deployment they have the safest possible & most super duper aligned agentic computer using AI that will never ever make any bio weapons.

I am also willing to bet money that for their next marketing campaign they will claim they have automated the red team for bioweapons research prevention & whatnot.

I was surprised at the low bounty too, considering the resources of openai

Last year I won a similar prompt injection challenge ran by a crypto startup against the latest claude and gpt (at the time) and it was considerably more money, from an org with maybe $5-10m in funding.

That and the restrictive NDA kinda tells me they're not looking for serious bounty hunters, who would either want a lot more money or, alternatively, to be able to publish their work; seems like a marketing stunt.

basically discount Kaggle. still get people poking at it, just none of the writeups or who-gets-paid drama.
This model is much more powerful than gpt-oss-20b, notice how the contest was not even for the 120b model. Also, bio was not a subject.
The model is more powerful, so the bounty is 1/20th the size? More risk, less reward?

"Biorisk" seems to be a concept not only invented by OpenAI but exclusively taken seriously by them. I wonder if this program is less about finding actual risks than it is hopefully casting a wide net for someone to help them prove their model is relevant in this space.

> "Biorisk" seems to be a concept not only invented by OpenAI but exclusively taken seriously by them.

This is false. Antropic just bundles it into CBRN. As for inventing it, the idea of AI-created bioweapons as a concrete risk far predates OpenAI as a company.

Not really. Anthropic has the "CBRN filter" on Opus series. It used to kill inquiries on anything that's remotely related to biotech. Seems to have gotten less aggressive lately?

I was reverse engineering a medical device back in 2025 and it was hard killing half my sessions.