Hacker News new | ask | show | jobs
by Nevolihs 750 days ago
Ideally OP would keep the source images of the original journal pages around even after transcription. I think ChatGPT (or LLM in general) is probably the best option, but the best overall solution would accept that LLMs are flawed and would require long-term iteration.
2 comments

The problem with ChatGPT is that you might not know to check the original.

If the original text is “I’m getting married on the 10th July”, you’ll know to check the handwritten note if it says “I’m getting married on the l@ July” but not necessarily if it says “on the 16th July”. ChatGPT seems to do the second quite often.

Thanks all, I tried ChatGPT and it didn’t like my handwriting at all.

Which is understandable… :’)

Have you considered training a model on your handwriting?
Yep! However that needs a ton of labeled data, so a bootstrapping method is required.

I like the idea of doing it by speech recognition, or of chopping it up for privacy and then outsourcing that to humans at cost.

One thing I … Imagine … would help—is having a private web app where I could pull up a document and then make a voice recording on my phone.

Maybe I’ll put this together on my plane trip.