Hacker News new | ask | show | jobs
by tkgally 712 days ago
That capability probably exists now if you are willing to accept cloud-based models and only moderately-sized contexts. With Claude 3.5 Pro, for example, one can put one’s reference data into a Project and query the model with that data in the context. In my testing, at least, it works quite well. The Projects can be shared among multiple users, too. The context size is only about one-tenth that of Gemini 1.5 Pro, though, and even the latter is probably much too small for most organizational purposes.

Of course, many organizations and regulators would not allow cloud-based models for sensitive data. A possible solution in that case might be multiple instances of an open-weight model hosted locally within the same secure environments as the sensitive data that the individual employees have access to. I don’t know how expensive that would be, whether current open-weight models are powerful enough, or whether context windows for open-weight models can be made big enough to be useful. But at least it suggests a potential path to a solution that doesn’t require training an LLM from scratch for each employee.