| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by mebutnotme 950 days ago
	I’d recommend trying to build out your systems to work across LLMs where you can. Create an interface layer and for now maybe use OpenAI and Vertex as a couple of options. Vertex is handy as while not always as good you may find it works well for some tasks and it can be a lot cheaper for those. If you build out this way then when the next greatest LLM comes out you can plug that into your interface and switch the tasks it’s best at over.

3 comments

quickthrower2 950 days ago

The problem is swapping LLMs can require rework of all your prompts, and you may be relying on specific features of OpenAI. If you don't then you are at a disadvantage or at least slowing down your work.

link

disconnection 950 days ago

I have a hierarchy of templates, where I can automatically swap out parts of the prompt based on which LLM I am using. And also have a set of benchmarking tests to compare relative performance. I treat LLMs like a commodity and keep switching between them to compare performance.

link

tin7in 950 days ago

Just curious are you using something specific for the tests?

link

bongobingo1 950 days ago

Just ask the LLM to rewrite your prompts for the new model.

link

worldsayshi 950 days ago

Does it really have that kind of self awareness to be able to do that successfully? I feel very sceptical.

link

Roark66 950 days ago

I doubt self awareness has anything to do with it..

link

worldsayshi 949 days ago

What else would you call the ability for it to adapt a task for its own capabilities?

link

mkl 949 days ago

Language modelling, token prediction. It's not much different from generating code in a particular programming language; given examples, learn the patterns and repeat them. There's no self-awareness or consciousness or understanding or even the concept of capabilities, just predicting text.

link

irthomasthomas 950 days ago

Just have it write 10 and bench them against your own.

link

rlt 950 days ago

Isn’t the expectation that “prompt engineering” is going to become unnecessary as models continue to improve? Other models may be lagging behind GPT4 but not by much.

link

te_chris 950 days ago

The dream maybe. You still have to instruct these natural language agents somehow, and they all have personalities.

link

pjmlp 950 days ago

Definitely, just like with games development, the key is to master how things work, not specific APIs.

AI tools will need a similar plugin like approach.

link

quickthrower2 950 days ago

I have a good idea how transformers work and have written Python code and trained toy ones, but end of the day right now calling OpenAI nothing I can build can beat it.

link

ramraj07 950 days ago

That would go as well as trying to write a universal android iOS app or write ansi sql to work across database platforms. A bad idea in every dimension.

link