Hacker News new | ask | show | jobs
by tedivm 1226 days ago
I've worked in ML for awhile (on the MLOps side of things) and have been in the industry for a bit, and one thing that I think is extremely common is for ML researchers to grossly underestimate the amount of work needed to make improvements. We've been a year away from full self driving cars for the last six years, and it seems like people are getting more cautious in their timing around that instead of getting more optimistic. Robotic manufacturing- driven by AI- was supposedly going to supplant human labor and speed up manufacturing in all segments from product creation to warehousing, but Amazon warehouses are still full of people and not robots.

What I've seen again and again from people in the field is a gross underestimation of the long tail on these problems. They see the rapid results on the easier end and think it will translate to continued process, but the reality is that every order of magnitude improvement takes the same amount of effort or more.

On top of that there is a massive amount of subsidies that go into training these models. Companies are throwing millions of dollars into training individual models. The cost here seems to be going up, not down, as these improvements are made.

I also think, to be honest, that machine learning researchers tend to simplify problems more than is reasonable. This conversation started with "highly scalable system from scratch, or an ultra-low latency trading system that beats the competition" and turned into "the parsing of and generation of this kind of code"- which is in many ways a much simpler problem than what op proposed. I've seen this in radiology, robotics, and self driving as well.

Kind of a tangent, but one of the things I do love about the ML industry is the companies who recognize what I mentioned above and work around it. The companies that are going to do the best, in my extremely bias opinion, are the ones that use AI to augment experts rather than try to replace them. A lot of the coding AI companies are doing this, there are AI driving companies that focus on safety features rather than driver replacement, and a company I used to work for (Rad AI) took that philosophy to Radiology. Keeping experts in the loop means that the long tail isn't as important and you can stop before perfection, while replacing experts altogether is going to have a much higher bar and cost.

8 comments

>We've been a year away from full self driving cars for the last six years

Try at least 12 [0]

(I would say 15 but my 45-second search didn't yield anything that far back)

[0] https://spectrum.ieee.org/how-google-self-driving-car-works

https://www.youtube.com/watch?v=I39sxwYKlEE was a self-driving van in 1986 which could detect obstacles by 1999 and drive in a convoy using that.

This is a bit like seeing Steve Mann's wearable computers over the years ( https://cdn.betakit.com/wp-content/uploads/2013/08/Wearcompe... ) and then today anyone with a smartphone and smart watch has more computing power and more features than most of his gear ever had, apart from the head mounted screen. More processing power, more memory, more storage, more face recognition, more motion sensing, more GPS, longer runtime on battery, more bandwidth and connectivity to e.g. mapping, more assistants like Google Now and Siri.

And we still aren't at a level where you can be doing a physical task like replacing a laptop screen and have your device record what you're doing, with voice prompts for when you complete different stages, have it add markers to the recording, track objects in the scene like and solve for questions like 'where did that longer screw go?' or 'where did this part come from?' and have it jump to the video where you took that part out. Nor reflow the video backwards as an aide memoire to reassembling it. Or do that outside for something like garage or car work, or have it control and direct lighting on some kind of robot arm to help you see, or have it listen to the sound of your bike gears rattle as you tune them and tell you or show you on a graph when it identifies the least rattle.

Anything a human assistant could easily do, we're still at the level of 'set a reminder' or 'add to calendar' rather than 'help me through this unfamiliar task'.

Wow - Steve Mann - haven't checked what he's doing in ages - real blast from the past :-) I was really disappointed the AR/VR company he was with went under - I had really high hopes for it.

RE: changing you laptop screen. My buddy wants an 'AR for Electronics' that can zoom in on components like a magnifying glass (he wants head mounted), identify components by marking/color/etc and call up schematics on demand. So far, nothing seems to be able to do that basic level of work.

Did you notice how far rotated the steering wheel is off of center while the Mercedes van vehicle is going straight on the highway? That looks crazy.
Based on my experience watching How It’s Made, many factories are extremely automated including lots of robots. Warehouses are not factories though.
It really depends on what you're talking about. Individual components can often be automated fairly successfully, but the actual assembly of the components is much harder. Even in areas of manufacturing where it's automated you have to do massive amounts of work to get it to that point, and any changes can result in major downtime or retooling.

AI companies such as Vicarious have been promising AI that makes this easier. Their idea was that generic robots with the right grips and sensors can be configured to work on a variety of assembly lines. This way a factory can be retooled between jobs quicker and with less cost.

Lookup lights out manufacturing. There are factories that often run whole days in the dark because there's no point turning on the nights if there's no one around
Not really. Although running CNC milling machines and lathes unattended at night is reasonably common. Day shift sets them up, and they cut metal all night.

Fanuc, the robot manufacturer, famously does run a lights-out factory, and has since 2001. It was the dream of Fanuc's founder. Baosteel now has a lights-out steel coiling facility. Both of these are more PR than cost effective.

There are many factories where there are very, very few people for large rooms full of machines, though.

They are mostly a myth, though not always.

This comment from 7 days ago covers it adequately:

https://news.ycombinator.com/item?id=34562122

You have just described Pareto's principle[0] the 80/20 rule. It takes 20% of the effort to get to 80% but it then takes 80% of the the effort to finish the final 20%.

[0]https://en.m.wikipedia.org/wiki/Pareto_principle

AI seems like Pareto's principle combined with Zeno's paradox.

Not because goal posts keep moving, but because we can only do 80% of the remaining distance each time, and the remaining 20% is still obvious.

The thing with software is, it happens slowly... then it happens all at once.
Ah, the good ol "A(G)I will arrive in 10 years!" --For the past 50+ years, basically.

It's a cautionary tale to people who are working in ML to be not too optimistic on "the future", but in my opinion being cautiously optimistic(not on AGI though) isn't harmful by itself, and I stand by that. Well at least until we hit the next wall and plunge everyone into another AI winter(fourth? fifth?) again.

As a plus, we do actually see some good progress that benefited the world like in biotech. Even though we are still mostly throwing random stuffs at ML to see if it works. Time will tell I guess.

Kurzweil gets a lot of flack for this sort of thing, he's generally presented as the ridiculous hype man for AI. And yet, he bet in 2002 that an AI would pass the Turing test by 2029. (And this is actually a more conservative prediction than "we will have AGI by 2029.") And looking at GPT3 it seems like he is probably going to win that bet.
I think the big revolution of the last few years has been to recognize that we'll likely get robots that can pass the turing test well before we get full self driving vehicles that can run anywhere there are basically ordinary paved roads.

I think even three years ago, most people would have thought the reverse.

So Kurzweil was imagining the turing test as the capstone to a decade of more and more capable ai products, not as "kind of early interesting success that may (or may not) presage really useful AI."

("The Turing test" is a pretty hazy target. I have no doubt that a chatgpt that was not trained to loudly announce that it was an AI could convince lots of people that it's a real human, right now. I think it's also the case that people with some experience with it could pretty quickly find ways to tell what it is.)

The Turing test has always been hazy - I don't think it's something we'll consider "passed" until at least a clear majority consider it passed (if not substantially further).

Otherwise you risk claiming ELIZA passed it, because a couple people thought so. Or that one Google employee this time.

Yes, that's what I was trying to say in the last paragraph. The Turing Test was an interesting thought experiment, not, like, an actual test. It's never been very clear how to operationalize it, and it's clear that Turing wasn't imagining how easily you can actively fool people. He was more making a point that we don't have an internal definition of intelligence -- it's not like multiplication where you can examine the underlying process and say, "Well, did it do this correctly?" You can only look at the results.
Good point, I do appreciate this comment. Thanks for adding this. It is is interesting in how it very much appears that he will be correct, but instead in a different way maybe than most of us would reasonably have guessed at the time.
Working out the engineering challenges will probably take a decade extra, but I wouldn't listen to the ML researchers' opinions on this issue; the evidence that they are in the drivers seat is shaky. We're still seeing exponential gains in processing power and we're closing in on order-of-magnitude amounts of processing power being available in silicone as in a human brain. There is a pretty decent chance that there is some magic threshold around there where all these tasks become easy with current algorithms.
Just let them bring on another AI winter.
I can understand that. I think that might be somewhat of a quick generalization. There are tendencies of people in the field to sometimes jump to rapid conclusions, but that is not researchers at all or in this case, me. I tend to be incredibly conservative, for example, and I have tangled with a number of "real world" systems enough to know some of the intricacies (though not at the edge).

If I were to make a point as to why your notes on self-driving cars and in-warehouse robots may not transfer to the case of software development, it's that they are fundamentally two very different problems with very different issues attached to them. It unfortunately is very much apples to oranges. They are both NP-hard but very different kinds of NP-hard.

A software program is a closed-loop target, though it is NP-hard. But we're optimizing for a different kind of metric here that is well-defined. Any kind of self-directed reinforcement-or-otherwise autoregressive-in-the-world algorithm is going to have an extraordinarily long tail of edge cases.

What I was talking about when I mentioned the geometry of the problem is not the parsing of the code, but the geometry of a near-optimal solution. Certainly, scale will be expensive, but Sutton is our friend here. That's why it's more "trivial" than problems that require humans in the loop -- you don't need humans to parse, structure, generate, and evaluate the data flow of a software code base, though admittedly if models like RHLF become popular as you noted, the endpoints that generate code under those geometric constraints -- those will become extremely expensive.

I think the geometric problem is very hard but the hurdle of scaled language models is more technically impressive to me.

What's nice is that unlike needing to generate a long, 1d story, too, there's more robustness with a huge field of possibility that's had years of work on the software side of things. It's not that it's going to be easy, but I think we've all grown as we've seen how hard self-driving cars are, and it's just not that kind of scenario, since all consequences of the 'world' within the repo-generation case are (for the most part) self-contained.

I hope that helps elucidate the problems a bit. To me, my optimism is much more rare, and only generally when I feel like I have a solid grasp of the fundamentals of it enough (i.e. I roughly know deliverability and have decent known error bounds on the sub-problems).

That said, I heartily agree with you that when all else fails -- assistive is good. What I see a "complete solution" doing well is creating a Kolmogorov-minimal, complete starting point and things evolving from there. Whether that works or not remains to be seen.

In other words, they’ll embrace, extend, and extinguish. Man, these AIs really are just regurgitating their training set!