JS can be removed from the final document using the -j flag. HTML Files can also be grepped for content, unlike PDFs.