Hacker News new | ask | show | jobs
by valstu 638 days ago
So regex version still beats the LLM solution. There's also the risk of hallucinations. I wonder if they tried to make SML which would rewrite or update the existing regex solution instead of generating the whole content again? This would mean less output tokens, faster inference and output wouldn't contain hallucinations. Although, not sure if small language models are capabable to write regex
1 comments

I think regex can beat SLM for a specific use case. But for the general case, there is no chance you come up with a pattern that works for all sites.