Why does the data change so much on the boundaries of states? I would expect the data to be locally correlated, and there wouldn't be discrepancies between neighboring rural counties that are in different states.
"State of residence" was a predictor in their linear model. A person's state (and its government) will probably affect the outcome variable to some extend, so it's not really surprising that it might be retained as a significant term. However, I find it hard to believe it's so important. I wonder if that predictor is simply confounded with some unobserved variable relating to methods of surveying that differed by state.
why are people so quick to wish away the concept of state lines? if there were no (or less) obviously visible state lines in the data, and everything was more of a gradient, I would be immensely surprised.
There's got to be something substantially skewing the data.
Even just looking at counties I'm familiar with it doesn't pass the sniff test.
You've got the "hicks live here" county that's the same color as the "yuppies buy houses in these suburbs" county that's the same color as the "not much English gets spoken here" county and so on. And yet it's all different than the adjacent counties that happen to be across state lines that have similar demographics.
Ah ok. So we are looking at pretty large inaccuracies then. Typically you only want to draw in the county boundaries if you have reliable county level data. I’d call that a flaw.
My thought exactly. If you took out the borders, many states would still be visible. I would expect the boundary between Tennessee and Kentucky, for example, to be more-or-less invisible, as between Texas and Arkansas. But even Oklahoma is distinguishable from northern Texas, despite basically no cultural or political difference across the Red River.
people live in cities/towns/etc. which are exist in counties which exist in states. the data is per-county. vaccination policies/attitudes are largely defined by governors, who lead states. states often have a shared culture, but sometimes different parts of states have varying cultures.
often, population density affects culture/attitudes toward these sorts of things. as with many such maps, adding an additional analysis layer denoting population density would yield interesting results. (though if you intuitively know the population densities of various "outlier" states, and you know where major cities are in the lesser-populated states, you can already intuitively draw some correlations yourself.)
The model probably takes into account the state as a variable, and states vary as far as average hesitancy goes and in terms of vaccine distribution strategy.
Very good point. I wonder if there are local state policies in effect that artifact the data or if there is some sort of state-level estimator being used.