Hacker News new | ask | show | jobs
by madarco 762 days ago
Nice demo but its lacks one feature to make it practical: splitting the input documents in chunks. Without it the embeddings will be too broad, and when retrieved the docs will consume a lot of input tokens, making the request slower/expensive