Edge has had this for a long time. I can highlight a string and right-click, 'Send to Copilot' and click "explain" and it'll prompt it to 'explain this passage, particularly in the context of the current page.'
Note: I have MS 365 personal or whatever it's called this week so I'm not sure how Copilot acts for a completely free user.
I'm guessing they'll integrate with the double-tap-the-bottom-of-the-screen feature that pulls up siri in front of a screenshot. Currently it doesn't seem to hook into "visual intelligence", and needs to call out to ChatGPT to do anything with the screen contents.
> double-tap-the-bottom-of-the-screen feature that pulls up siri
It’s disabled if not using Apple Intelligence, and can’t tap screen while talking to Siri (it dismisses instead).
Now they’re gating features to the M3 I’m not convinced wouldn’t work on expensive Apple Silicon predecessors… am more convinced the double tap disable is intentional.
Note: I have MS 365 personal or whatever it's called this week so I'm not sure how Copilot acts for a completely free user.