|
|
|
|
|
by JayShower
520 days ago
|
|
Alternative solution that would require less heavy lifting of ML but a little more upfront programming:
It sounds like the cars are arranged in a grid on the wall. Maybe it would be possible to narrow down which car the user took a photo of by looking at the photos of the surrounding cars as well, and hardcoding into the system the position of each car relative to one another?
Could potentially do that locally very quickly (maybe even at the level of QR-code speed) versus doing an embedding + LLM. Con of this approach would be that it’s requires maintenance if they ever decide to change the illustration positions. |
|