|
|
|
|
|
by randomknowledge
4889 days ago
|
|
Okay hows this: unless the author of the original blog post is deliberately deceiving the audience but putting a bunch of points on top of each other then one can tell there is no line which most of the points lie near. I agree that a scatter plot is not the best for showing data with that many data points, but frankly it's kind of irrelevant. The point the author was making was just that it isn't that hard to cook up some data that is not highly correlated, but will be if you bin and average it. |
|
See the discussion here: http://news.ycombinator.com/item?id=4027337
I stand by the claim made in my blog post. Don't use scatterplots, use a density plot instead.
Incidentally, according to a comment the author made, the correlation is actually 0.3. That's far better than his graph suggests.