| For both personality and preference vectors, it would be great to see data from the data from OKCupid, from its good old days (https://www.reddit.com/r/gwern/comments/aapn1l/okcupid_blog_...). Even more, since for questions there are: - questions one decided to answer (which says something on its own) - answers - declared answers they accept in their partners - actual partners they pursue (judged by matches, or dates) While I expect that mostly similarities attract (so-called associative mating), there are compatible traits (e.g one person loves to listen, one person loves to talk), and there is the level of lack of self-knowledge, or hypocrisy (what we SAY we like, vs what we actually do). And then e.g. probability that person A likes person B can be expressed as: sigmoid(actualPrefVecA * personalityVecB) ...and with gradient descent magic, we can turn people into vectors! |