Hacker News new | ask | show | jobs
by zamadatix 1188 days ago
Could always rely on the Levenshtein distance. You have to be careful with similarity approaches though as you may end up filtering important messages because they are structurally similar to the unimportant message.
1 comments

Maybe you could use a language model embedding to define some kind of semantic distance.