Hacker News new | ask | show | jobs
by onnodigcomplex 1144 days ago
I spend months collecting the dataset to create a wordlist. And I also spend 100+ hours just judging squares and words. And eventually also training char-level LLM's to do that job for me. In Dutch you can compound and there is quite an active morphology so it can be really tricky to judge whether a given word is any good.

Since I worked on it on and off, and also did a bunch other related things I'm not sure, but I would be surprised if I spend any less than 1000+ hours on these silly word-games.