|
|
|
|
|
by fudged71
951 days ago
|
|
Classic HN response. This is just an early taste of a potentially powerful use case. I understand the vision API doesn’t have memory, so each screenshot it takes is like an entire new context. If the script/application is able to send WHAT application it’s in, and has some RAG database in the backend to pull knowledge from, this would be incredibly useful. Of course it’s slow now. If you’re legitimately stuck, a couple seconds for a personalized answer is a perfect trade off. It will get better. |
|