Hacker News new | ask | show | jobs
by dbish 838 days ago
Totally agree on the baseline. We’ve found that adding multimodal data like what was onscreen to be a big help to improve over this, which is a little more complex. Helps more to add action data to like who was typing in what, where the mouse was, etc.

I’ve also been playing with pulling in knowledge base context or reading relevant web pages for unique words to create that initial prompt and custom vocab automatically.

1 comments

Gotta add facial recognition too so the notes can include "Bob rolled his eyes slightly when Alice mentioned the new reporting procedures."
Lol. I should add that as an option you can toggle