Hacker News new | ask | show | jobs
Show HN: I used an LLM to parse HN Who's Hiring comments to structured job data (hnjobsai.vercel.app)
16 points by tekmaven 900 days ago
Hello! I used an LLM to extract structured data from each comment on the “Ask HN: Who is Hiring? (January 2024)” post, available at: https://news.ycombinator.com/item?id=38842977. This created a dataset of 400+ jobs with data such as title, company name, location, is job remote, salary data, and the URL/email to contact. For the UI, a NextJS static site was created to filter and view the structured data.
6 comments

Funny that only 51 out of 475 jobs show the salary while tech people are very vocal against not showing it.
Good stuff. Remote is not a boolean though. It would be good to specify:

* Which countries can apply for the role (or is it Global)

* Which timezones can apply for the role (or is it Global)

If not specified you could assume you need to be a citizen of their country and in their time zone +/- 3 hours.

I think this is where an LLM can shine, it may be able to make good guesses about this stuff.

Nice, a time/date on each comment might be useful, so you can find the jobs posted an hour ago rather than 5 days.
Thanks for the suggestion. I added the time posted as the first piece of displayed metadata. Also, I updated the sort order to display the newest jobs first.
Awesome!

Small suggestion, move the "Link to original comment" button, to the HN comment drop-down.

Thanks for the feedback! I've moved it below the original comment now. I made some small adjustments to the original comment display as well.
Can we get a "Kubernetes" added to the technologies filter?
Pretty awesome!!