I hate to be that guy, but if you’re looking for an open source alternative to Algolia, Typesense is another alternative to consider: https://github.com/typesense/typesense
It doesn’t have fancy “AI” features, but has all the other fundamentals for fast and performant search covered, including UI components.
Since this is indeed just a marketing page, I don't see enough implementation details to tell, but in general I worry that this kind of thing makes it easier for people to find stuff when they're being vague and harder to find stuff when they're being specific. It's like how Google these days will so "helpfully" ignore the words that you search for and instead give you what it's convinced you totally wanted.
Can I search HN, sort by date and hide all the hits which have never been shown in the top 30? Occasionally I want to find something I've seen among the titles on the HN index page some months ago.
In the case of HN, I assume that being once in top 30 would add a minimum number to the vote count. So a simpler query would be "sort by date and hide all the hits which have below 100 votes".
The real "have never been shown in the top 30" would need an actual event tag added to each post when they reach top 30. Then Algolia can search that I guess. Hidden tag search is a common practice.
You can only sort by popularity, which maps to 'score' in the data structure that the API sorts. You can see that here: https://github.com/HackerNews/API
The highest position on front page is not, AFAICT stored in any way and is not searchable.
Every search company will be expected to have machine learning capabilities. However, if you simply bolt them on top instead of considering the fast volume of additional data as part of the core engine, the experience will suffer greatly and the results will be subpar.
To be fair, Algolia's marketing is great. They did a great job with the announcement. Looking forward to so how well it actually works and what the pricing is going to be. I don't think OpenAI have announced pricing for the GPT-3 service (which this is based on) yet, but I'm certain it won't be cheap to start with.
This seems more spam than anything else. What's new about their approach? Why is this relevant? I don't want to be mean but I don't understand why this is in Hacker News
They are the official search for HN itself. That should have given you the hint. They are a YC company, so surely these posts rank up, perhaps because the alum group do some voting for peers. Which I feel is a good side-effect of being in YC.
It doesn’t have fancy “AI” features, but has all the other fundamentals for fast and performant search covered, including UI components.