Y
Hacker News
new
|
ask
|
show
|
jobs
by
zamadatix
1188 days ago
Could always rely on the Levenshtein distance. You have to be careful with similarity approaches though as you may end up filtering important messages because they are structurally similar to the unimportant message.
1 comments
voldacar
1187 days ago
Maybe you could use a language model embedding to define some kind of semantic distance.
link