Y
Hacker News
new
|
ask
|
show
|
jobs
by
pizza
606 days ago
I think you might be thinking of applying a kind of low-rank decomposition to the vocabulary embeddings. A quick search on Google Scholar suggests that this might be useful in the context of multilingual tokenization.