Hacker News new | ask | show | jobs
by Jonanin 1954 days ago
This is an incredibly naive perspective. I guess you want to ban search engines, self driving cars, automated filtering of lewd and abusive content (why do you think FB isn’t full of porn? It’s not a hand engineered algorithm), automatic speech recognition for the hearing impaired, and a vast swath of important technology I didn’t list. I don’t think you really understand the implications of what you’re asking for. Sorry - black boxes are here to stay. And they are immeasurably useful. I could spend hours listing important and crucial technologies that you want banned because you are scared of racism.
3 comments

Search engines already worked before ML, neither automated filtering nor self-driving cars actually work in reality.
I agree with you but I am still scared of racism.
> I agree with you but I am still scared of racism.

My suspicion is that the concern with machine learning over racism is rooted in two things. The first is just the general modern trend of accusing anything you don't like of being racist, because everybody hates racism and wants to fight it. And the second is the fear on the part of people who make a living fighting racism that machine learning might actually put them out of a job.

Because machine learning is basically a paperclip optimizer. You tell it to maximize a thing, it maximizes the thing and minimizes everything else. Racism isn't paperclips, so the paperclip optimizer will optimize for smashing it in favor of making more paperclips. And then they're out of business.

Because when you look at the criticism of this stuff, it generally looks like this. ~12% of the population is black, only ~5% of the selected applicants are black, the algorithm is accused of racism.

But nothing is that simple, because all kinds of things like income and education level and so on correlate with race, so you have to take all of those things into account before you can tell what's going on. And taking into account all of the available data is how machine learning works.

Which isn't to say that you couldn't make an algorithm racist. Tell it to optimize for applicants with a particular skin color and it does. But then your problem isn't with the algorithm, it's with the jackasses who asked for that.

What to optimize for is a much more general and difficult question. (Hint: Not paperclips.)

> My suspicion is that the concern with machine learning over racism is rooted in two things. The first is just the general modern trend of accusing anything you don't like of being racist, because everybody hates racism and wants to fight it. And the second is the fear on the part of people who make a living fighting racism that machine learning might actually put them out of a job.

I don't get to how you go from this statement, to then again explaining exactly how racism is embedded in algorithms. By using the biased data we have in the real world...

It isn't the data that's biased. If you're hiring a computer scientist and disproportionately few black people have a degree in computer science, the data is not lying about who the qualified applicants are and the algorithm can't change that.

To fix that you have to cause more black high school students to go to college and study computer science and then wait two generations until their proportionality in the installed base of qualified computer scientists reaches parity. There is no magic wand that makes it happen overnight.

But concentrating on the places where it can't be solved instead of the places where it can will make it take even longer.

No, the racism is a real issue, though a lot of it is caused by limited training data. Having an image recognition algorithm identify Africans and South Asians as gorillas doesn't happen because the designers intended it, but because their training data had only light-skinned human faces and dark-skinned primates. But the effect is racist even though this wasn't the intent.

Likewise, if the system is trained to duplicate human decision-making (like who gets loans), interesting things can happen: if the decision-makers unconsciously favored whites over blacks, the algorithm could wind up weighing skin color or stereotypically Black or Latino names negatively, meaning that the final model is explicitly racist, just because there is a correlation in the training data. That doesn't mean we shouldn't use deep learning, it means that it's not responsible to just fit the training data and ship without testing for such problems.

> Having an image recognition algorithm identify Africans and South Asians as gorillas doesn't happen because the designers intended it, but because their training data had only light-skinned human faces and dark-skinned primates. But the effect is racist even though this wasn't the intent.

This isn't racism at all. It's just bad PR because humans take the implication that calling black people monkeys is calling them stupid, since that's the implication you would draw if a person did that.

An algorithm doing that is just recognizing that humans and gorillas are both primates:

http://www.aquilaarts.com/bushmonkey.html

And then it's a bug, in the same way that recognizing a black balloon as a balloon but a white balloon as a light bulb is a bug. It has nothing to do with race at all. The algorithm isn't racist against white balloons. The solution is a general increase in the amount of training data, which is what you want in all cases regardless.

> if the decision-makers unconsciously favored whites over blacks, the algorithm could wind up weighing skin color or stereotypically Black or Latino names negatively, meaning that the final model is explicitly racist, just because there is a correlation in the training data.

Except that this is exactly the thing that a paperclip optimizer will smash to bits because it interferes with the goal of making more paperclips.

I’m not an expert in this, but I think racists call black people apes, not just because they think they are stupid, but because they think they are sub-human.

Blacks don’t reach the intelligence and blah to be human. I think that’s what racists drive at when they call someone a monkey, and that’s why it’s so offensive.

It would also make your theoretical AI racist, as it identified blacks as not human.

Honestly, at the end of the day that is what is so difficult about much of this. It’s mostly subjective

> It would also make your theoretical AI racist, as it identified blacks as not human.

That isn't how racism works. It's like saying that an AI that misclassifies a bat as a bird is racist. It's not racism, it's just error.

And it's not a race-specific error, it's a general error for which someone cherry picked the instances that imply a racially motivated intent that doesn't actually exist.

Calling it racism is pointless and misleading because there is no race-specific cause or solution to the problem. The solution is completely identical to the one for the same error in the general case, i.e. get more training data.

This isn't theoretical: Google Photos was identifying Black people as gorillas, and they didn't fix it, they worked around the bug by removing "gorilla" as a possible label in 2018. Some here seem to be saying that we can't call this racist unless someone specifically intended to do this because of hate. It's not subjective when someone's own face is so flagged.

https://www.theverge.com/2018/1/12/16882408/google-racist-go...

I absolutely want to ban self-driving cars that behave in ways no human can explain or understand! The mere idea that anyone would think that should be legal is borderline insane.

All you are doing here is convincing me that tech companies are just runaway trains with nobody at the controls!

> I absolutely want to ban self-driving cars that behave in ways no human can explain or understand!

Can you explain or understand the algorithms humans use to drive cars?

Screw that.

Explain to me step by step how you walk.

Humans are held responsible if they cause harm to others. If a driver hits a pedestrian on purpose he is charged with murder. Who do you charge if a self-driving car behaves in this way?
Who do you charge when the brakes don't work on your car? When the airbags don't activate? When your rain sensor doesn't work?

Believe it or not, your car is not that primitive when compared to a self-driving one in terms of the number of things it does autonomously.

Isn't it the case that car manufacturers will have to issue a recall on defective models ?
Yes, and the same holds for self driving cars. I fail to see the difference.
the humans that designed the car? to be clear, computers don't intentionally do anything. if an engineer deliberately programs a car to hit pedestrians for no reason, they would be charged with murder. if the car hits a pedestrian as a result of an engineering mistake, the company would be liable for damages, and if particularly egregious, engineers might face manslaughter charges.
To be clear, that was my point. You can't punish the computer and good luck finding the one to punish for an accident ten years later. But maybe if it's free software...
Punishing bad driving is - I assume - intended to incentivise better driving.

If a company makes a self-driving car and that car then drives badly, surely the response needs to be to incentivise the company to improve their engineering practices, eg, spend more on testing, or require more levels of review of changes, or whatever other organisational changes they need to make safer cars. You don't need to find an individual person responsible to create that incentive. And if you really do want to find an individual responsible it can easily just be the executives of the company (and the executives are probably pretty easy to find even a decade later).

Well, you blame the black box algorithm that nobody can predict or understand, and you just call it an "accident".
Who would you blame if the algorithm could be printed on paper?
only if caught
What about all the other examples he listed. What about cancer detection? Or viral spread prediction? Drug discovery or medical imaging diagnosis? Physics research?

Machine learning is very widely used in the sciences and extremely beneficial to humanity in uncountably many ways and assuredly countless more to come. Of course technologies can be used for evil but so can nearly everything that exists. I believe your proposal comes from a desire to help or better the world, but to ban all non-human-readable algorithms is frankly ridiculous and demonstrates a naive understanding of the issue. It sounds a lot like the calls by the U.S. Congress to ban encryption.

Here is what I think:

- In medical: your doctor should be responsible for your diagnosis and drug company is responsible for defective drugs, except when they get away with lobbying and hiring good lawyers.

- In physics: I'm not sure if it's as big of a problem as in social networks. But consider this case: If you cannot reproduce the result of an experiment due to a ML model being cryptic, that would lead to huge credibility issue in science.

At best, you may be able to justify black boxes providing secondary indicators: Maybe using AI to study cancer detection might lead you to a new solid discovery, but "we use AI to determine if you have cancer" should never be the mission, as it fails to generate useful information about how it is detected.
> fails to generate useful information about how it is detected

Patients don’t care how cancer is detected. Patients care if the diagnosis is correct.

Continue this line of thinking, would you want all algorithms banned? Might as well shut everything down :shrug:
We can't even explain all physical phenomena, so good luck with banning anything that depends on the gravity of earth to function, because we don't know what gravity is.
But gravitational laws stay unchanged for millenials isn't it ? If I toss an apple, it will falls down. If I throw it fast enough, it goes into orbital mode.