Hacker News new | ask | show | jobs
by Invictus0 336 days ago
You could settle this fairly easily by training two small models on highly disparate datasets, maybe historical Chinese texts and historical greek texts, in their native languages, and seeing if the same similarities recur.