| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by liampulles 249 days ago
	If you take the embedding for king, subtract the embedding for male, add the embedding for female, and lookup the closest embedding you get queen. The fact that dot product addition can encode the concept of royalty and gender (among all other sorts) is kind of magic to me.

1 comments

puttycat 249 days ago

This was actually shown to not really work in practice.

link

intelkishan 249 days ago

I have seen this particular work example to work. You don't get the exact match but the closest one is indeed Queen.

link

godelski 249 days ago

Yes but it doesn't generalize very well. Even on simple features like gender. If you go look at embeddings you'll find that man and woman are neighbors, just as king and queen are[0]. This is a better explanation for the result as you're just taking very small steps in the latent space.

Here, play around[1]

  mother - parent + man = woman
  father - parent + woman = man
  father - parent + man = woman
  mother - parent + woman = man
  woman - human + man = girl

Or some that should be trivial

  woman - man + man = girl
  man - man + man = woman
  woman - woman + woman = man

Working in very high dimensions is funky stuff. Embedding high dimensions into low dimensions results in even funkier stuff

[0] https://projector.tensorflow.org/

[1] https://www.cs.cmu.edu/~dst/WordEmbeddingDemo/

link

liampulles 248 days ago

Thank you for the comment!

This led me to do a bit more research, and I see indeed the queen result is in itself infact "cheating" a bit: https://blog.esciencecenter.nl/king-man-woman-king-9a7fd2935...

#TheMoreYouKnow

link

yellowcake0 249 days ago

so addition is not associative?

link

godelski 249 days ago

I think you're missing the point

link

yellowcake0 248 days ago

It's a pretty exotic type of addition that would lead to the second set of examples, just trying to get an idea of its nature.

link

mirekrusin 249 days ago

Shouldn't this itself be a part of training?

Having set of "king - male + female = queen" like relations, including more complex phrases to align embeddings.

It seems like terse, lightweight, information dense way to address essence of knowldge.

link