Hacker News new | ask | show | jobs
by raunakchowdhuri 785 days ago
Interesting... how did you do the scraping of the documentation?
1 comments

I found the documentation for k8s (https://kubernetes.io/docs/home/) in this repo in PDF form: https://github.com/dohsimpson/kubernetes-doc-pdf, so this part ended up being pretty simple thanks to them! From there I used PyPDF to parse the docs for the vector store.

I haven't started digging into the RAG performance, so open to ideas on ways to improve this!

kubectl also offers the `kubectl explain someresourcekind.someproperty` command which might be a nice way to get docs / schemas for CRDs (custom resource definitions) - though you'd need your tool to essentially RAG from the cluster in question to introspect it this way which might be a bit hairy
I’ve noticed the GPT agent running kubectl explain when trying to diagnose an issue. I think it could be a nice enhancement to have a way to place that into the context ahead of time, but the commands would have to be a bit more rigid in that the user would have to specify the resource to explain ahead of time.

I’ll add an issue for this though since I think it could definitely improve the performance!