| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by GolDDranks 2 days ago
	Why aren't the role tags preprocessed algorithmically/deterministically and then fed in as one-hot-encoded vectors alongside the semantic word embeddings? I'd imagine that it would be easier to train to _stay_ in the role an not confuse it, if the current role marker is explicitly set as a part of each input token, and not just implied by some past token. Plus a input separate from the word embedding would be unforgeable.

1 comments

Always wondered this. Must have been tried and not worked?