Hacker News new | ask | show | jobs
by xcv123 936 days ago
The scenario is a customer opens a chat box on your website and asks some questions for the LLM.

You wouldn't expect your customers to search your internal Confluence pages. The LLM would be trained on all of your internal documentation which is not exposed publicly.

Hallucination is mostly a problem with insufficient training with the current generation of LLMs.

Edit: Maybe not "all" of your internal docs should be exposed via LLM. But the idea is this is an interactive support agent for customers.

1 comments

that sounds like a dangerous scenario. If your docs are intentionally internal and not public, why would you let a publicly accessible LLMs answer questions with info from them?

An LLM trained on public docs for the public could be a better interface for projects with lots of public documentation.

An LLM trained on internal docs only accessible to internal users might be similarly useful

Even a private LLM on public docs for your support agents to use could increase their efficiency.

But I would never expose an LLM to the public that has been trained on data I don't want public

Yes, hence my quick edit of my comment above just before you replied