I have not. From the changelog, the differences between my work and theirs are:
- Mac only vs Windows only
- They have already wired some AI stuff like speech recognition (trivial with Whisper these days, I was able to use it to generate synchronous lyrics ala karaoke to my home music collection in about 1 week of coding. Unlike video does not require much compute)
- They have slick GUI and presumably reliable recording - as I did not decide to productize it yet, I only have 2 global hotkeys to start and stop.
- I capture more data: keyboard + focus, gaze traces, and mouse traces. This will allow better behavioral models (they could and probably should have an option to do it too). I especially rely on gaze, as it is a very dense data channel.
- I have functionality to replay user actions both to just view, and to actually replay them (this is where copilot-like AI will eventually be connected).
It was funny to see the codename of my project in one of their screenshots as a label on a control.
- Mac only vs Windows only
- They have already wired some AI stuff like speech recognition (trivial with Whisper these days, I was able to use it to generate synchronous lyrics ala karaoke to my home music collection in about 1 week of coding. Unlike video does not require much compute)
- They have slick GUI and presumably reliable recording - as I did not decide to productize it yet, I only have 2 global hotkeys to start and stop.
- I capture more data: keyboard + focus, gaze traces, and mouse traces. This will allow better behavioral models (they could and probably should have an option to do it too). I especially rely on gaze, as it is a very dense data channel.
- I have functionality to replay user actions both to just view, and to actually replay them (this is where copilot-like AI will eventually be connected).
It was funny to see the codename of my project in one of their screenshots as a label on a control.