| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by unlinkr 2473 days ago
	No, you don't need more than a regular expression. If you want to extract elements, i.e. match start tags to the corresponding end tags, then you need a stack-based parser. But just to extract the start tags (which is the question) a regular expression is sufficient. The original question is a question about tokenization, not parsing, which is why a regular expression is sufficient.