2. Turn unstructured, difficult to parse data adorned with HTML, and other window dressing into structured data that is easier to parse.