Hacker News new | ask | show | jobs
by Spivak 1143 days ago
Oh sure! So the OCR is a statically linked build of tesseract based on [1] and pytesseract [2] which is a super thin wrapper but easier than writing it yourself. The I stole/modified the prompt from [3] to get the bot to write Python programs that do date calculations. Then I used [4] to take the output in case the Llm didn't use the format I asked. I run the Python code it generates in a rootless container that uses the Lambda RIE [5] because I was too lazy to make my own thing. So I'm mildly lied about the v8 isolate because everyone gets the implication and who wants to hear about RIE and seccomp profiles.

[1] https://github.com/wingedrhino/static-tesseract

[2] https://pypi.org/project/pytesseract/

[3] https://python.langchain.com/en/latest/modules/agents/tools/...

[4] https://pypi.org/project/dateparser/

[5] https://docs.aws.amazon.com/lambda/latest/dg/images-test.htm...

1 comments

This is amazing, thank you for sharing!