| > Point me to some in-depth discussion about the ramifications of taking an unrestricted GPT model and giving it access to the internet. I'm just not aware of any such discussion, whether on HN or anywhere else. That's what I'm wondering about. Respectfully, you’re probably not seeing that question asked and answered because it doesn’t quite make sense as phrased. What does mean to “give an LLM access to the internet”? The same as your calculator doesn’t do anything until you put in some numbers and operators, an LLM doesn’t do anything unless you give it a prompt and some technical parameters. And then once it has those, it generates roughly the number of tokens (~words) you indicated in your parameters. Then, like your calculator, it’s done. It doesn’t do anything else until you put in another round of input. There are technical and computational limits that make both your prompt and the token limit fairly small. Several hundreds of words at most. Again, kind of like how your calculator might only with 8 or 9 digits. Now, you can give it “access to the internet” as part of responding to your prompt and fulfilling your token limit, and that’s roughly what Microsoft has done with Bing Assistant. They set it up so that Bing Assistant can take your prompt, generate a search query, and then give itself a new (still short) internal prompt with a summary of your request and the search results. And that’s pretty much what you get when you give an LLM access to the internet. The ramifications really aren’t that big, and we’re probably at least five or ten years of AI research and compute hardware development from making them interestingly bigger. (i.e. too far away to meaningfully guess what to expect) |
One point the article makes is that getting from a "prediction engine" type of AI to an "agent" type of AI is probably just a matter of sticking the prediction engine in the python loop that goes
It wouldn't be literally that easy, and the engine would require a lot of ChatGPT-style fine-tuning first, but it wouldn't require a completely novel breakthrough in machine learning.