Hacker News new | ask | show | jobs
by dmos62 1431 days ago
I realise this is an overly-broad question, but any insight into what's the state-of-art in Similarity Learning for article-type text?

More specifically, I'm interested in deriving distances between writing style, arguing style, etc.

2 comments

There's study here: http://cs230.stanford.edu/projects_spring_2021/reports/57.pd...

Basically, you can collect text from different authors, then you can use authors names as labels to train a similarity learning with it. My suggestion would be finetune a Transformer model with a specific head and an ArcFace loss.

It is definitely possible to do, if you have a proper training set. You would need to somehow give model a signal, that you are interesting in e.g. arguing style specifically, and not the topic of the text.