Hacker News new | ask | show | jobs
by liam-hinzman 404 days ago
Deduplication is mainly driven by LLMs with search results as context. Our entity resolution works well because Exa’s main business is crawling and indexing the web at scale, and we can control how we search across that within Websets.

As far as I know ChatGPT’s search is primarily a wrapper around another company’s search engine, which is why it often feels like it’s just summarizing a page of search results and sometimes hallucinates badly.

1 comments

Thanks for the info. That makes sense.

Looking forward to trying out the product more when I have a moment.