| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by triceratops 411 days ago

> by being shown an excerpt [of copyrighted material]

How is this done? Are bits not written into RAM or disk? Are they not sent between machines in a training cluster? That's copying.

> it is seemingly not far removed from how humans consume content

Except that humans don't make full copies to RAM, or disk or paper.

2 comments

Workaccount2 411 days ago

The is a bar of usage built into the law, otherwise everyone who reads this wired article is violating copyright by making a full copy to their computer. Generally making non-lasting copies is fine, otherwise the internet wouldn't work.

AI doesn't need lasting copies to train, however I don't know what the actual implementation is. But if it's ruled that they can only use copyrighted data if it's not stored for more than the time it would take a human to consume, It wouldn't really cripple the models, but perhaps make training more logistically challenging.

It's important to understand that models are not data archives. They are statistical constructs made from getting quizzed, that uses human made content to generate the quiz questions.

link

triceratops 411 days ago

> otherwise everyone who reads this wired article is violating copyright by making a full copy to their computer

Wired explicitly sent that article to their computer for the purposes of reading it so it's not a copyright violation.

link

spwa4 411 days ago

> Except that humans don't make full copies to RAM, or disk or paper.

Images on your retina form exact copies.

They are scanned and translated into impulses that are then sent to a first set of "neural columns" - that's an exact copy.

This is then connected to the visual cortex by the two most high bandwidth links in the human body ("the optical nerve", there's 2 of them of course, always wondered why everybody insists on using the singular). Why would you have that high bandwidth link unless to create verbatim copies.

The way those columns are structured also very strongly suggests they make carbon copies, which they then make available on the "brain bridge" (which is probably at least vaguely similar to the "attention matrix" of a transformer). If it does work like that, that's also a verbatim copy.

The only way "humans don't make full copies to RAM" is that humans don't have separate RAM. The processing power is colocated with the processing, even on a microscopic level. You know, what everybody knows is the best way of doing things even in silicon, it's just incredibly impractical if you can't rebuild your circuit every time there's a slight change to the instructions your "computer" carries out (the brain is not a "Von Neumann architecture", except it kind of is when it regrows connections. But in the short term it isn't)

link

triceratops 411 days ago

> that's an exact copy.

Not for the purposes of copyright law.

> is that humans don't have separate RAM [or disk]

And that turns out to be incredibly important. Humans can't create a lasting, shareable copy of a copyrighted work by consuming it.

link

spwa4 411 days ago

Sure they can. You can learn a copyrighted work by hard, even indirectly, then quickly duplicate it by hand. Mozart was originally famous for making a business out of that.

link

triceratops 411 days ago

> then quickly duplicate it by hand

And that's a copyright violation.

link