|
|
|
|
|
by vercantez
820 days ago
|
|
GPT-4V is great for reasoning about what is on the screen. However, it struggles with precision. For example, it is not able to specify the coordinates to tap when it decides to tap an icon. That's where the object detection and accessibility elements help. We can precisely locate interactive elements. |
|
Was one thing I never got around to testing with DemoTime but was always curious about.
Anyway sorry this is a nice product. Congratulations on the launch.
Always good to see substantial tech