Hacker News new | ask | show | jobs
by startupsfail 1178 days ago
Yes, relatively old. The issue is, this approach is designed to work with the “classical” language model, trained using “The Pile” methods. This particular one was Palm 540b.

So essentially you have an approach, designed to work on these models that are not really instruction following models and that truly are stochastic parrots.

The models had changed substantially since. But the approach of chaining them in this particular way stuck. And is getting copied everywhere, without much thought.

1 comments

Your answer doesn't make any sense regarding langchain to be honest.
Sure. I’m just expressing my opinion that the design is suboptimal and that the level of design is literally “You are GPT-3 and you suck at math” [quote from the LangChain code base].

I don’t want to see further expansion of this. I’m not offering higher level design, because I’m not sure about safety of all of this.

But having a poor and potentially unstable design like LangChain also doesn’t contribute to it :-/

Sorry to bring up an older thread, but I was looking into LangChain recently, and I was thinking of making something similar but in other languages. Do you have any insight into what direction is better to move in for LangChain-like alternatives?