Hacker News new | ask | show | jobs
by codeaken 3358 days ago
When pattern matching goes wrong... Good find, we will fix this, thanks! :)
1 comments

It has the look of using regex against raw page html.

I would guess you'd have better luck parsing the html and extracting the href attributes of any <link> tags, src attributes of <script> tags, etc. Then pattern matching only against that.