Hacker News new | ask | show | jobs
by srad1292 1423 days ago
I recently hit the same wall of finding good, freely-available dictionary data. My family and I like to play Quiddler here and there(where you get cards and build words with them) and I wanted to make a Quiddler solver that would take your hand/top card in discard and would give you a list of the best combination of words by total points, by word length with points as a sub sort, and by number of words with points as a sub sort. I found multiple sources of JSON data with inconsistent formatting or missing information. I mainly wanted the list of words and what the word type of each definition was so I could quickly filter out words that didn't have a whitelisted type. I ended up settling with just a Scrabble dictionary which was a list of strings and manually removing words here and there as they came up if they didn't belong. Understandable that the big names in dictionaries spent a lot of resources getting to where they are. That said, it's hard to believe something as fundamental as a record of our language doesn't have an open source and well-organized dataset accessible by API or download.