Hacker News new | ask | show | jobs
by mebutnotme 950 days ago
I’d recommend trying to build out your systems to work across LLMs where you can. Create an interface layer and for now maybe use OpenAI and Vertex as a couple of options. Vertex is handy as while not always as good you may find it works well for some tasks and it can be a lot cheaper for those.

If you build out this way then when the next greatest LLM comes out you can plug that into your interface and switch the tasks it’s best at over.

3 comments

The problem is swapping LLMs can require rework of all your prompts, and you may be relying on specific features of OpenAI. If you don't then you are at a disadvantage or at least slowing down your work.
I have a hierarchy of templates, where I can automatically swap out parts of the prompt based on which LLM I am using. And also have a set of benchmarking tests to compare relative performance. I treat LLMs like a commodity and keep switching between them to compare performance.
Just curious are you using something specific for the tests?
Just ask the LLM to rewrite your prompts for the new model.
Does it really have that kind of self awareness to be able to do that successfully? I feel very sceptical.
I doubt self awareness has anything to do with it..
What else would you call the ability for it to adapt a task for its own capabilities?
Language modelling, token prediction. It's not much different from generating code in a particular programming language; given examples, learn the patterns and repeat them. There's no self-awareness or consciousness or understanding or even the concept of capabilities, just predicting text.
Just have it write 10 and bench them against your own.
Isn’t the expectation that “prompt engineering” is going to become unnecessary as models continue to improve? Other models may be lagging behind GPT4 but not by much.
The dream maybe. You still have to instruct these natural language agents somehow, and they all have personalities.
Definitely, just like with games development, the key is to master how things work, not specific APIs.

AI tools will need a similar plugin like approach.

I have a good idea how transformers work and have written Python code and trained toy ones, but end of the day right now calling OpenAI nothing I can build can beat it.
That would go as well as trying to write a universal android iOS app or write ansi sql to work across database platforms. A bad idea in every dimension.