Hacker News new | ask | show | jobs
by ninjagoo 85 days ago
> I'm half expecting to see "AI model" appearing as stand-in for "linear regression" at this point in the cycle.

Already the case with consulting companies, have seen it myself

1 comments

Some career do-nothing-but-make-noise in my organization hired a firm to 'Do AI' on some shitty data and the outcome was basically linear regression. It turns out that you can impressive executives with linear regression if you deliver it enthusiastically enough.
Tbh, often enough, linear regression is exactly what is needed.
Yes, and we do it every day and call it 'linear regression' and don't need a data center full of expensive toys to do it
You do unsupervised learning without labels with a linear regression. Interesting. What would you regress in this case? The problem is the following: you have a point cloud of data (electronic signal from arrays arranged into an irregular pattern). You know the physics that was discovered. You are looking for rare events (one in a billion or less) and you don’t know what they look like.
And you think we did not try linear regressions? This is what we used to do 20 years ago. Then we gained two orders of magnitude in signal-to-background discrimination. And since our data are not even images, off-shelf solutions mostly don’t apply. Try to process40 MHz of incoming collisions (1 MB each) within 100 nsec with a linear regression of point-cloud data. When you are done trying, try to think that maybe (maybe…) life is not as easy as bread&butter. If you succeed, come and knock at CERN’s door. Maybe we will let you in…
Not everyone knows everything so knowledge is the new oil.

I do know about linear regression even had quite some of it at university.

But I still wouldn’t be able to just implement it on some data without good couple days to weeks of figuring things out and which tools to use so I don’t implement it from scratch.

Implement it...from scratch? Its literally least squares regression. Its a few lines of code. What are you trying to say here?
You have to get the data first build all data processing pipelines to get your parameters for linear regression.
A lot of data. More than google + netflix + you name it. And you have 100 nsec. And the data are not on disk. Good luck with your linear regression