Here's a slightly different use case. I want something that can index all open tabs in the browser so that I do not have to leave hundreds of tabs open.
There are semantic embeddings libraries that are fast enough to run on entire webpages in ~hundreds of millis or low single digit seconds. I've been thinking a lot recently about making a browser plugin that simply does semantic embedding on a paragraph level of every web page I ever visit, and store it in a vector database.
This would enable querying my little private search engine like "the HN story a few weeks ago that talked about ancient greek mining techniques" or "the reddit comment that had an analysis comparing Orwell's 1984 to the bible".
For those not familiar, semantic embeddings take a chunk of text and embed it in a high dimensional vector space (~hundreds of dimensions) where semantically similar texts are closer together.
That’s extremely interesting, I would argue that the reason for keeping tabs open varies, but is something along the lines of: re-reaching the page in the tab is too slow
More specifically, I do a lot of context switching and still try to maintain a reasonable amount of open tabs.
The problem that I have is that I can't remember days/months later if something I read is in an open tab or closed a while ago resulting in some frantic searches.
Tab grouping is one way to tame the madness. I’ve tried it with the default tab groups feature in Chrome but keep loosing the groupings whenever I restart chrome. Anyone know a better grouping extension?
This would enable querying my little private search engine like "the HN story a few weeks ago that talked about ancient greek mining techniques" or "the reddit comment that had an analysis comparing Orwell's 1984 to the bible".
For those not familiar, semantic embeddings take a chunk of text and embed it in a high dimensional vector space (~hundreds of dimensions) where semantically similar texts are closer together.