|
|
|
|
|
by Epa095
962 days ago
|
|
True, but the example in the post (Ask HN: Who is Hiring) shows somewhere it CAN be usefull: when they are not just collection emails for generic spam, but where the adressed are more valuable because they can be used for some targeted add (e.g. an add for a programming-job sendt out to programmers). Those lists are more valuable. One can also imagine using GPT as part of a pipeline. Maybe one can extract most emails using regexps or other similar measures, but also use simple (computationally) heuristics to collect paragraphs of text which are likely to contain emails (which the regexps did not manage to extract), and then use chatGPT (or cheaper LLMs) to extract only from the most valuable paragraphs. Probably usefull (for the spammers) to also be able to categorize the paragraph, so programmers get programming-spam and chefs get chef-spam. |
|