Hacker News new | ask | show | jobs
by nikhonit 185 days ago
I built this because my YouTube 'Watch Later' playlist hit 500+ videos and became a source of anxiety rather than a queue. I wanted to know if a 2-hour podcast contained the specific citation I needed without watching the whole thing.

Under the hood: Recapio grabs the transcript (prioritizing manual captions over auto-generated ones) and uses an LLM to generate structured summaries with timestamped citations.

The Engineering Challenge: The biggest headache was 'hallucination drift'—where the AI summary claims a topic starts at 10:00, but it actually starts at 10:45. I solved this by implementing a chunking strategy that overlaps context windows, forcing the model to verify timestamps against the raw text segments before outputting the link.

It’s a work in progress. I'm curious if anyone has better strategies for handling the lack of punctuation in auto-generated YouTube captions

1 comments

Cool How can ping you if I may ask please?