Hacker News new | ask | show | jobs
by ok123456 641 days ago
The 'genres' emerge out of the clusters. The fact that you can pick out 'genres' from this plot is an example of semi-supervised learning.
1 comments

OK but there's no clue what they are until you start poking around. It's not something that's useful without some substantial investment of time and exploration. If that's the goal, fine but it's not how I would present a "favorite books" list.
Making 'genres' and classifying things is imprecise and requires experts in subject matter and library science to get it right. The "genre" labels here emerge out of the data itself.