Hacker News new | ask | show | jobs
by blackbear_ 2557 days ago
Yes, you are correct. In other words, the relationship between temperature and quality is not linear, so directly using the temperature in a linear model gives wrong results. (To be pedantic, the probability will approach 1 as the temperature goes to infinity)
2 comments

Well, in the real world you can only boil water so the highest temperature of (uncontained) water at nominal pressures is ~100C no matter how long you wait or how much heat you apply. Even in an espresso maker the max pressure sets the max temp. So it might be that the maximum temperature is the right answer (not infinity) for a good cup.
The example problem seems to be completely inappropriate then for the article/model.
I think it's a little more complicated then that. A variable might not be linear in general but may be approximately linear within a certain range of values. You might fit the model on values only within that linear range and thus get a good fit. The model may be very useful inside the range of fitted values but garbage at extrapolation. As long as you understand the limitations it can still be a useful model.
While you’re right, the original post is meant to be pedagogical. Someone who doesn’t understand the fundamentals of model selection might learn the wrong lesson(s).

You kinda have to expect a student to use the examples you give.

There’s really no upside to using that example.

If I were the author, I would drop things like temperature unless I am willing to discuss nonlinear transformations thereof.

Instead, why not simplify it down to D=Freshness of Coffee Beans ?