| HN Mirror

You seem to be suggesting implementing natural language processing as a series of regexes.

If NLP was that easy, we wouldn't have needed to invent transformer models, and we'd have had things as capable as ChatGPT about the same time that Microsoft was selling Encarta on CD.

The reality is, this soft fuzzy thing is the only practical way to minimise the Scunthorpe problem (and its equivalents for false negatives): https://en.wikipedia.org/wiki/Scunthorpe_problem