Free Output – AI output copyright status checker | HN Mirror

Y	Hacker News new \| ask \| show \| jobs

	Free Output – AI output copyright status checker (freeoutput.org)
	35 points by knewter 445 days ago

7 comments

6stringmerc 445 days ago

Very interesting tool yet completely irrelevant for the United States as AI generated content is not eligible for copyright protection by anyone (pending appeal). As it stands y’all may not like this reality, but it’s quite clear in legal terms. Claiming an AI generated work is protected by copyright simply doesn’t matter regardless of which entity is asserting the right at present.

AndrewSwift 445 days ago

I don't believe this is the case — in the situation that is commonly referenced to make this point, someone sought to have an AI legally declared to be the author of a specific work, and that was ruled not to be possible. But I am not aware of cases where people use prompts to generate artwork with AI and have found it impossible to copyright.

mountainb 445 days ago

For more, see the Copyright Office's reports on this: https://www.copyright.gov/ai/Copyright-and-Artificial-Intell...

rustc 445 days ago

What's the practical use of this? The AI doesn't know if the output is sufficiently different from the training material. If the output you get matches pre existing content, the license these AI companies give you won't save you.

TuringNYC 445 days ago

>> If the output you get matches pre existing content, the license these AI companies give you won't save you.

Really? Isnt that the purpose of the indemnification agreement the vendors have underwritten?

HeatrayEnjoyer 445 days ago

We don't see it aggressively enforced in the US (unclear if that status quo will continue) but copyright infringement is also in the criminal code, and that can't be indemnified.

Civil indemnification still means a sued party must go to court and assert it as a defense, and there's no guarantee that a judge won't throw it out as invalid. These are uncharted legal waters.

Multicomp 445 days ago

I guess I thought that If an image was generated by these tools, at least in the US, the copyright office did not consider it to have any copyright at all, therefore it was by default public domain?

numpad0 445 days ago

Note that you can still violate someone else's IP rights, only your side of claims will be null and void if courts determine you're not the creator of content.

dijksterhuis 445 days ago

short version in the USA only

* generated without any human interaction/prompting -> not copyrightable.

* with human input/prompting -> it depends.

see this comment in the thread and the child comment providing a link to a report by the US copyright office (read page 2 of the executive summary)

https://news.ycombinator.com/item?id=43518945

Retr0id 445 days ago

It claims that OpenAI output is "free", but the OpenAI ToS says (among other things)

> You are prohibited from ... Using Output to develop models that compete with OpenAI.

If this were a software license, it'd surely be classified as nonfree.

alexgleason 445 days ago

This means they would potentially cancel your account if you violated it, but not that they would claim ownership over the work.

jchw 445 days ago

But I believe since a ToS isn't a copyright license, this can't really be enforced using copyright laws. Most likely they can ban you. Is there even a slim chance you could be sued for breach of contract? Hell if I know, I'm not a lawyer.

Thinking another layer deep, though, if someone used OpenAI tools to develop software that then later got used to compete with OpenAI, surely it would fully workaround this already unenforceable ToS restriction anyways, right?

binarymax 445 days ago

And as we can see from DeepSeek this clause means nothing, outside the realm of OpenAI blocking your access to its models.

wizzwizz4 445 days ago

I know someone from OpenAI claimed this, but is there any evidence that DeepSeek actually trained their models on output of the models OpenAI have?

binarymax 445 days ago

They talk about some examples in their research.

> “Specifically, we initialized the DeepSeek-Prover using the DeepSeekMath-Base 7B model (Shao et al., 2024). Initially, the model struggled to convert informal math problems into formal statements. To address this, we fine-tuned the DeepSeek-Prover model using the MMA dataset (Jiang et al., 2023), which comprises formal statements from Lean 4’s mathlib2 that were back-translated into natural language problem descriptions by GPT-4. We then instructed the model to translate these natural language problems into formal statements in Lean 4 using a structured approach.”

Section 3.1 in https://arxiv.org/html/2405.14333v1

wizzwizz4 445 days ago

I was thinking of their general-purpose models, like DeepSeek-R1 and DeepSeek-V3, for which I haven't found evidence that OpenAI models were used to generate synthetic training data. But I didn't find this, so clearly my searching skills aren't great.

knewter 445 days ago

just found FreeOutput, a website that compares AI providers based on whether they assign copyright to the user.

bionhoward 444 days ago

Wrong about OpenAI because it has a customer noncompete

IshKebab 445 days ago

Oh I thought this was actually going to try to check if your output actually matched any copyrighted material, which would be useful.

Oh well.