Hacker News new | ask | show | jobs
by pjscott 5325 days ago
Another anecdote: I was writing an HTML-to-text converter. The prototype used lxml and some custom DOM-traversal and formatting logic in Python. I got about a 17x speedup from porting the thing to use C and libxml2 (the parser that lxml uses). The port to C took most of an afternoon, and it's currently chewing through a lot of HTML without a problem.
1 comments

I can confirm that libxml2 is indeed great.