Hacker News new | ask | show | jobs
by jdnier 249 days ago
If you want to do this rigorously, I suggest you read Robert D. Cameron's excellent paper "REX: XML Shallow Parsing with Regular Expressions" (1998).

https://www2.cs.sfu.ca/~cameron/REX.html