Hacker News new | ask | show | jobs
by theRealEros 384 days ago
I feel like we are missing an important point...the transformer model that has been used here is trained on known languages. This means it cannot extract meaningful embeddings from a text in a unknown language...are the plots just noise then?