Hacker News new | ask | show | jobs
by walshemj 1890 days ago
And ban the use of PDF's - which is another way this could be avoided.

Oh and mandate clean links no funky javascipt links that search crawlers don't follow.

2 comments

Per the main project link, the pricing information required to be posted in JSON format with specific schema and file names.
JSON is allowed, but not required. They just required open, non-proprietary, formats. They specifically gave YAML and XML as other examples.
Search engines can index pdfs.
Yeh but they don't work as well and wont be as discoverable
Provided the content is not encrypted.
I stand corrected: encrypted PDFs are indexed unless a password is set. However, in the past they were unable to do so[0]: "Generally we can index textual content (written in any language) from PDF files that use various kinds of character encodings, provided they're not password protected or encrypted."

[0] https://developers.google.com/search/blog/2011/09/pdfs-in-go...

I think the assumption is that encrypted pdfs would not be used here because users still have to view these documents
No, the PDF standard explicitly allows encryption without the need to set up user password; in this case, the owner password is set. The only change for the user is that the word "Secured" usually appears next to the filename in the reader.