|
|
|
|
|
by dbish
838 days ago
|
|
Totally agree on the baseline. We’ve found that adding multimodal data like what was onscreen to be a big help to improve over this, which is a little more complex. Helps more to add action data to like who was typing in what, where the mouse was, etc. I’ve also been playing with pulling in knowledge base context or reading relevant web pages for unique words to create that initial prompt and custom vocab automatically. |
|