Slurping nicely formatted data from known sources gives them nicely formatted data that they can then cross-reference and use to help data-mine the raw data.