| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by jhoelzel 1231 days ago

the title in question is misleading and likely the negative feedback received you received is aimed at that.

The primary objective of ChatGPT is not to serve as a finished product answering queries, but rather to undergo fine-tuning with respect to given prompts. This model is capable of utilizing a variety of sub-models and executing queries as necessary.

Let's consider an example where a user inputs the sentence "What is the capital of France?" This is how ChatGPT will answer you:

> 1 Preprocessing: The chat interface processes the user's input and recognizes that the user is asking for the capital of a country.

    2 Generation of prompt: Based on this information, the chat interface generates the following prompt: "The capital of France is __."

    3 Input to GPT-3: The prompt and the user's input are fed into the GPT-3 model.

    4 Generation of response: The GPT-3 model then generates a response based on the prompt and input, for example: "The capital of France is Paris."

    5 Postprocessing: The response generated by the GPT-3 model is postprocessed by the chat interface to ensure that it is grammatically correct and in the correct format. The final response returned to the user is: "The capital of France is Paris."

What i mean by that is that of course you will be able to use smaller more efficient models to get faster, quicker results and what ChatGPT really is, is the world training it what prompts look like. The "GPT Part" is only used after heavy preprocessing i would not be surprised if it already knew how to use its smaller models itself =)

1 comments

kettleballroll 1231 days ago

This does not at all sound how all LLMs I know work. Eg no model the size of ChatGPT should need grammar correction. Having that run client side sounds even more weird. Where did you take this information from?

link

jhoelzel 1231 days ago

from itself and other interfaces around.

And i didnt say client side, there probably is simply an api wrapper in between.

I also believe that this is how its moderation works. Chatgpt has not all "bad stuff" removed from its data, but there simply is an output filter.

The question always still is how to get from complex prompts to actual answers and boy have i tried complex prompts!

Of course im looking in from the outside with no validity to my claim but if i were to write a bot from scratch thats also how i would do it.

link

querez 1230 days ago

> from itself

That is a horrendously bad source. ChatGPT doesn't know how ChatGPT works, because nothing about how ChatGPT works was published on the internet at the time when ChatGPT was trained. Frankly, there's still nothing on the internet about this.

> Of course im looking in from the outside with no validity to my claim but if i were to write a bot from scratch thats also how i would do it.

ChatGPT wasn't "written" as much as trained. Writing a bot and training one require completely different mentalities.

link