Hacker News new | ask | show | jobs
by dbrereton 1555 days ago
Lots of great new search engines popping up that search the rest of the web that Google tends to ignore.

Other ones worth checking out include:

- https://search.marginalia.nu/ (A non-commercial search engine)

- https://wiby.me/ (Tends to have those really weird and cool indie sites)

- https://searchmysite.net/ (An index of personal websites)

- https://indieweb-search.jamesg.blog/ (Search IndieWeb websites)

- https://millionshort.com/ (Ignore the first million results from Google)

8 comments

I think this is among the more complete lists:

https://seirdy.one/2021/03/10/search-engines-with-own-indexe...

I was thinking of making a small site showcasing indie search engines or small search engine projects ig
Wish someone would meta search these search engines
There's fairly aggressive bot activity when you run a search engine, which is why many of us are behind cloudflare or similar bot-mitigation services.

Best guess is the bots are attempting to manipulate Google or Bing's suggestion algorithms this way (since a lot of small search engines are basically just forwarding results from bigger engines). Unfortunately they can be very aggressive to the point of almost bringing down your service. I saw up to 30k searches per hour before I cried uncle and got behind cloudflare.

I do offer an API, and I suspect some others in the indie search space might as well (or might at least be willing to set one up). That way I can rate limit on a per-consumer basis without affecting everyone.

Searx and SearxNG support a few of these, but instances get blocked and unblocked left and right.

eTools is a metasearch engine that uses commercial APIs for its search providers, so it does not get blocked. However, each search is a bit expensive for eTools, so users might get blocked by eTools instead.

+1 for andisearch also. Totally different approach. Have barely seen it talked about on here. The results are dope, however. I very much appreciate marginalia as well. I applaud the programmers making new and interesting tools in this space.
Credit to them for trying some new things on the UI front, but it looks like the organic results are from Bing (like most other alt/privacy search engines). It would be interesting to learn more about how/if they plan to build their own index, or set themselves apart.

IMO, Kagi and Brave search are the two best alternative general search engines right now.

Runnaroo was pretty good as well ;-)

I think Mojeek is a better alternative "general" engine than Brave because Brave's ranking algorithm was optimized against Google SERPs (back when it was called Cliqz), making many SERPs too similar to Google's.

Kagi uses a mix of Teclis and other engines (claims to use Bing and Google) but the ability to adjust ranking yourself is its wild card. Neeva is similar, combining its own index with some ability to influence ranking.

But personally, I'm trying to reduce my use of "one engine to rule them all" and instead use specific engines for specific tasks they're good at.

Mojeek is excellent, and because they use 100% their own index they have a much higher hill to climb.

When I say "Best"for a general search engine, my definition is that it would fulfill the needs of myself and my non-technical family members. Kagi and Brave Search both do that while being different enough to not be just another Bing clone. I use Mojeek often, think it is great, and having their own index is a tremendous asset, but it doesn't quite meet that full definition yet.

An additional thing of interest. The text of the results such as heading and summary on andisearch is not the same as the other search engines. I think it displays more of each web page's content from its index.
Interesting. With search 'java lambda function equivalent' I see results different to Bing, Google and DuckDuckGo. Greater difference still for "elon musk latest news". My guess, they use Bing where they lack their own index.
Curious as to how these engines accept new sites into their ranks. A big problem most of us have had is spammy results outranking everything else by building large, fake networks of sites that boost each other's rankings via interlinking. Many of the higher end networks are undetectable, as they have legitimate content and never link to more than 1 other internal site (among a mix of external sites, some affiliated with still other networks).
I use Personalized PageRank giving disproportionate voting power to a bunch of people whose entire identity is their passionate dislike for the commercial web. To get a really high rank in my search engine, you need to convince a those people to link to your site.
One approach is to have human testers. When a low quality site gets high rank, you investigate in detail how that happened and downrank the linking sites.

It shouldn't be that hard to find the bad network if you're systematically investigating all the time. Google has people testing search results often.

The problem is that this is fairly expensive. But quite possibly not the largest cost a search engine would have.

Wiby has been a great experience. I love being able to contribute sites to it too, to.
http://yup.is for business news
Curious if anyone has recommendations for great enterprise search?
Algolia. It is used on here on the search bar on the end of the page.
It is insanely expensive though. I think all of your indexes are stored in RAM.
you.com is worth checking out