Many comments are criticizing the usage of LLM for this use case but I do believe this will become more common in the future. For example, OpenAI's retrieval plugin leverages LLM to do PII detection [1] instead of using the traditional libraries [2].
For this specific problem, I trust the large number of companies that have product lines with devoted test suites more than I do a random LLM. Sometimes it’s better to pick the correct specific tool for a job than a random general purpose tool.