| Ohi, I'm the author of the open source Searx metasearch engine. I'm working on a self-hosted search service called Hister with the same goal when I started Searx development: reduce dependence on online search engines. Hister is a full text indexer for websites and local files which automatically saves all the visited pages rendered by your browser. It provides a flexible web (and terminal) search interface & query language to explore saved content with ease or quickly fall back to traditional search engines. This is a fundamentally different approach than what Searx follows and solves most of the weaknesses of metasearch engines. Of course it has its own weaknesses as well, but most of these are not conceptual and can be resolved by improving the software (and datasets) I've been using it for a few months and as my local index is growing I can avoid relying on external search engines - and even websites listed in results - more and more frequently. The initial reception is overwhelmingly positive with already more than 30 contributors and hundreds of contributions. Currently it can help with "recall" type searches mainly, but I'm planning to provide pre-indexed thematic datasets and I'm drafting a peer-to-peer index sharing concept. Maybe you can find it useful as well (or at least have some constructive criticism =]). Links:
- https://hister.org/
- https://github.com/asciimoo/hister
- Background/motivation/beginnings: https://hister.org/posts/how-i-cut-my-google-search-dependen...
- Small read-only demo: https://demo.hister.org/ |
The only feedback I have is the initial indexing from my large history was rough. There were a lot of domains that kept blocking me for exceeding rate limiting or wouldn't let me index at all. I could see it being useful to import a history file and organize it by domain inside some sort of temporary database to track/distribute attempts and get a more detailed report on complete domain failures.
Regardless though - great work!