|
|
|
|
|
by simonw
163 days ago
|
|
For covering the risk of mistakes I suggest considering ways of "visually quoting" the documents. If the summary says "closing timeline: X" but there's an icon I can click that pops open an overlay with a visual cropped screenshot of that part of the original PDF - maybe even with a red circle around that detail - I can trust those summaries a whole lot more. Gemini 2.5 has image bounding box and masking features that can help with this (sadly missing from Gemini 3.) |
|
Quick question are you talking about this feature?
https://docs.cloud.google.com/vertex-ai/generative-ai/docs/b...
Because it’s just using structured response so it should be doable with Gemini 3 ? (We are using Gemini 3 for some docs processing and its visual understanding is just incredible)