Hacker News new | ask | show | jobs
Show HN: An Embedding-Based Search Service over ShowHN, AskHN, GitHub, More (payperrun.com)
7 points by eneuman 962 days ago
Hey folks, Elias here. Excited to unveil my latest project.

Why I Built This: Traditional keyword search isn't cutting it. I've used LLM-embeddings to provide more nuanced, relevant results.

How It Works: LLM-embedding similarity on curated datasets for semantically similar results. No need to iterate over keywords any more.

Current Datasets:

- YC Companies - Show HN Posts, - Ask HN Posts - ProductHunt Startups - Github Top 200k Repos

Use Cases:

- Validate a product idea's existence - Check if someone already Asked HN something - Have fun - search random terms and see what pops up

Want to see other datasets? Got one in mind? What would you use it for? (I'm even thinking of adding all PG's tweets - so feel free to be creative.)

Eager for your feedback and ideas. Hope you find this useful and fun!

2 comments

I like the section on how it works: https://payperrun.com/%3E/search?display=How%20this%20servic...

The vector search is using https://lancedb.com/ and OpenAI embeddings.

This is will be really useful for research within the ingested domains! Awesome.
Thank you for the feedback!

If there are any other domains you'd like to see, let me know :)

I'm thinking Crunchbase would give you a comprehensive view into Silicon Valley (and adjacent) company data, but the data is behind an API.