Hacker News new | ask | show | jobs
by nostrebored 582 days ago
Poorly, just like it does for text.

Chunking is easily where all of these problems die beyond PoC scale.

I’ve talked to multiple code generation companies in the past week — most are stuck with BM25 and taking in whole files.

1 comments

What do they use BM25 for? RAG?
Correct -- finding the correct functions and files to include