Hacker News new | ask | show | jobs
by Retr0id 493 days ago
> Parsing Is Not a Science

It can and should be, though. I feel like we should have a separate word for parsing when the rules are not well-defined - something like "fuzzy parsing" (in a similar vein to fuzzy string comparison)

3 comments

Renaming the problem doesn’t make it go away. It might be useful for identifying the subset of parsing which is problematic, but I think the article already achieves this well by specifying the subset of input under discussion.
It doesn't make the problem go away, but it makes it clear that parsing itself is not the problem
It's "scraping".
It’s called “guessing”.