Hacker News new | ask | show | jobs
by gruez 529 days ago
>Even that can fail. I’ve had multiple cases now where my name (!) was auto-parsed wrong in a way human would never do, not even after copy-pasting into Notepad. [...]

sounds like they did something like:

    first_name, last_name = name.split(" ")
which is an issue, but unrelated to what everyone else is talking about, which seems to be how text data is being parsed/encoded inside pdfs. Pasting into notepad would check for that issue, but obviously wouldn't do anything for bad first name/last name extraction logic.
1 comments

I felt that the general topic was "software failing to do the right thing parsing the text in PDF resumes in ways a human would get right", which includes all of these types of problems, not specific to encoding issues. But, sure.