Hacker News new | ask | show | jobs
by applgo443 1046 days ago
How do you approach the problem of what files to look into to fix a bug? Just embeddings doesn't seem to cut it.
1 comments

We use some simple ranking heuristics detailed here: https://docs.sweep.dev/blogs/building-code-search

One thing we also do is match any files mentioned in the issue. So if you mention sweepai/api.py, we'll find that and add it to the fetched files. There's still more work to be done here, so look out for those!

Likely file name based scoring, and other rules + finetuned retrieval models (opt-in of course)