|
|
|
|
|
by ArnavAgrawal03
328 days ago
|
|
You can add OCR with Gemini, and presumably that would lead to better results than the OCR model we compared against. However, it's important to note that then you're guaranteeing that the entire corpus of documents you're processing will go through a large VLM. That can be prohibitively expensive and slow. Definitely trade-offs to be made here, we found this to be the most effective in most cases. |
|