| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by seinvak 175 days ago

> Would love to learn more about how this is built. I remember a similar project from 4 years ago[0] that used a classic BERT model for NER on HN comments

Yes, I saw that project pretty impressive! Hand-labeling 4000 books is definitely not an easy task, mad-respect to tracyhenry for the passion and hardwork that was required back then.

For my project, I just used the Gemini 2.5 Flash API (since I had free credits) with the following prompt:

"""You are an expert literary assistant parsing Hacker News comments. Rules: 1. Only extract CLEARLY identifiable books. 2. Ignore generic mentions. 3. Return JSON ARRAY only. 4. If no books found, return []. 5. A score from -10 to 10 where 10 is highly recommended, -10 is very poorly recommended and 0 is neutral. 6. If the author's name is in the comment, include it; otherwise, omit the key. JSON format: [ {{ "title": "book title", "sentiment": "score", "author" : "Name of author if mentioned" }} ] Text: {text}"""

It did the job quite well. It really shows how far AI has come in just 4 years.