Hacker News new | ask | show | jobs
by totoasticot 2534 days ago
I would strongly advice anyone interested by the labels frequencies of dataset to get a look at the json file provided. Interesting thing is not that much the frequency of X or Y labels, but the frequency of one set of labels. (this would be a great addition on the webpage actually).

Pictures can have multiple labels. And so, having the ratio of "Forum + Drugs + Finance" vs "Market-place + Weapons" dispense more information than just the global frequency of "Finance"-related pages :)