Hacker News new | ask | show | jobs
by eh9 1184 days ago
I just saw a project that lets you input an entire repo into GPT. Coincidentally, my place of employment just told us not to input any proprietary code into any generator with a retention policy.

Even then, I feel like the play will be an enterprise service instead of licensing.

2 comments

If it's the product I think it is (I don't recall the exact name), it's not putting the repo into GPT. It's calculating embeddings on the code in the repo, storing those in a vector db and providing context from the store when processing questions about the repo. Effectively when you ask "how does foo work" becomes 1. lookup code items related to foo getting 1-N copies of code. 2. ask GPT "here is code related to foo <result from 1>. Now answer the following question: how does foo work"
I think we’re talking about different projects. This one just gives you a text output of an entire repo.

https://github.com/mpoon/gpt-repository-loader

I was thinking about https://github.com/rahuldan/codesearch for context.
Link?