I think you need to adjust the variation of scores, I got 1650 actually guessing, then realized most of the scores were low and got 1800 by just always guessing 239
Just change the range of scores. If you're not going to deliberately weight the set of stories to include outliers, then the whole game is really played in the 150-400 range anyways, so make that the slider.
Anchoring!!!
If the range has nothing to do with the items in question, just have simple number input. Tested multiple times and all items were below default / midpoint.
It helps a lot just to have an intuition for what a big article is here. The slider starts at ~1500 votes, which is an insanely successful story. You can get a strong score just by guessing 400 for everything, nudging up +100 for things you remember being popular and down -100 for things that seem obscure.
Could split the stories into buckets and then randomly sample from each bucket. Most stories are small, so they’re currently overrepresented in the sampling.
A "replacement-level" front page story is ~200 votes, +/- 50. If you're drawing just from the front page, and not from a deliberately weighted set of successful vs. marginal posts, most stories should be below 400.
(It's weird to say this but I'm not nerdy enough to have actually worked this out with data; it's just intuition from spending time here.)