| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by nathanwh 245 days ago
	This reminded me of ViperGPT[1] from a couple of years ago, which is similar but specific to vision language models. Both of them have a root llm which given a query produces a python program to decompose the query into separate steps, with the generated python program calling a sub model. One difference is this model has a mutable environment in the notebook, but I'm not sure how much of a meaningful difference that is. [1] https://viper.cs.columbia.edu/static/viper_paper.pdf