Hacker News new | ask | show | jobs
by weird-eye-issue 223 days ago
It has to get the image data first, basically just IO time before processing it
1 comments

IIRC there's pre-processing (embedding/tokenization?) before feeding images to LLMs?

Hit this issue optimizing LLM request times. Ending up lowering image resolution. Lost some accuracy but could bear that.