Hacker News new | ask | show | jobs
by klmadfejno 1920 days ago
An equivalent to excel's record macro functionality, but at the os level, based on what's on the screen using ocr.

Sikuli got close, but something with a little more AI behind it to get a sort of "fuzzy" macro recording.

3 comments

Windows at least has AutoHotKey[1] -- it's kind of awful, but also kind of great. It has been a path into programming for more than one person I know and has saved others countless hours of repetition.

1 - https://www.autohotkey.com/

An app I built, Keysmith [1], does something pretty close to what you're describing. It's macros anywhere on your Mac, but instead of using OCR we use accessibility APIs to identify UI elements by name. There's also a browser extension to do the same on the web.

And you can save macros for later use or just create a one-off "Scratch" macro as described here [2].

[1] https://www.keysmith.app/ [2] https://www.keysmith.app/guides/creating-and-running-macros

Do you mean Robotic Process Automation? It’s OS level and ticks all the blocks.
Conceptually yes. I've yet to see one that is built for an end user to make something on a one shot example though, with an output script that you can tweak.

Everything I've seen has been very enterprise-y.