Hacker News new | ask | show | jobs
by FieryTransition 1040 days ago
One way to do it is to use cosine similarity[0], the reason to do this is to get around the context window limitation, and hope that whatever text chunks you get which via the similarity function is the correct information to answer your question.

How do you know that Claude doesn't do this? If you have multiple books, you end up with more than 100k context, and running the model with full context takes more time so it is more expensive as well.

[0] https://en.wikipedia.org/wiki/Cosine_similarity