Y
Hacker News
new
|
ask
|
show
|
jobs
by
martian
6038 days ago
This line seems problematic:
use strict;
Most of the web is messy. Beautiful Soup and its ilk would seem like a better choice for parsing.
2 comments
gloob
6038 days ago
Ahem.
http://www.perl.com/doc/manual/html/lib/strict.html
link
martian
6038 days ago
Ouch, should have RTFM. Thanks for the pointer.
link
geoscripting
6038 days ago
strict is a standard perl module. HTML::TreeBuilder seems to work just as well with malformed HTML.
link
http://www.perl.com/doc/manual/html/lib/strict.html