Hacker News new | ask | show | jobs
by Cyphase 716 days ago
> Also, what makes me think I can effectively imagine the average person’s vocabulary? People from other countries, ethnic groups, genders, socioeconomic backgrounds, or with different language backgrounds, might have quite different opinions on many borderline words. For example, I was quite conscious of the idea in the last few years that men and women tend to know different words. For example, practically, should I include TAFFETA or not?

Did you consider getting some other people to do this manual curation process? You could then use roughly the intersection of all lists.

1 comments

I did consider this but the effort of doing a full dictionary pass is a lot to ask for marginal improvement and I don't know anyone quite as obsessed as me who would do it. Paying someone would be possible, but finding the right combination of "willing to do it for pay" and "I trust their judgement" is hard.

In practice, the way I approach this is by reacting to complaints from players who either don't know words I included or were disappointed I didn't include a particular word.

How about doing a pass where you categorize as in for sure, out for sure, and idunno. There may not be too many of those idunno‘s, so may be possible to enlist help.

That said your approach seems to have worked well. Kudos!

Yup, that's a really good point. I kind of wish I had marked ambiguous words on a first pass, and then taken a bit of a different approach for a second pass of just the difficult ones.
Do you have metrics around how easy/hard words are for players?
I don't, unfortunately. I'm trying to avoid having a dedicated backend for this so there are Google Analytics but they don't allow that granular of a metric.

That definitely could be something interesting, but I'd probably need a decently larger player base to get enough data, considering how many words are possible.

I made a root comment before seeing this one but here's my shot at answering this question with the data / player base that I have: https://news.ycombinator.com/item?id=40902840