|
|
|
|
|
by andai
198 days ago
|
|
>the similarity check doesn't appear to do translation This surprises me. The system is based on embeddings. AFAIK embeddings cluster the same concept in different languages in roughly the same place? Maybe it depends on the model (or maybe it's not exact and the clustering cutoff loses it). |
|
The embeddings themselves will (pry) cluster ok in different languages (but I have not tested this yet)