If you want to play with something similar, but at a different level of abstraction (software, interfacing with the Google Cloud Vision API in a way that's accessible to non-technical folks -- e.g., teachers are using this pattern to create "scavenger hunts" in their classrooms [0]), check out how it's being done in Metaverse[1].