Hacker News new | ask | show | jobs
by totoglazer 1249 days ago
This might be an interesting language model. However people care about ChatGPT entirely due to its quality, which this doesn’t demonstrate yet.
1 comments

The leap in public exposure wasn't so much GPT3 to GPT3.5, it was attaching a clean UI to the model, (with sane defaults) and allowing people to talk to it like a person.

Suddenly it became something 'real' then.

(This is purely talking about the public popularity of GPT)

This is mostly correct. GPT3.5 is better, has a larger context window, etc. But it's a very incremental step above GPT3.

I had wired up GPT3 to a Twilio phone number and made something basically like ChatGPT months before ChatGPT was released -- me and my friends texted it all the time to get information, similar to how people use ChatGPT. The prompt to get decent performance is super simple. Just something like:

    The following is a transcript between a human and a helpful AI assistant.
    The AI assistant is knowledgeable about most facts of the world and provides concise answers to questions.

    Transcript:
    {splice in the last 30 messages of the conversation}

    The next thing the assistant says is:
Over time I did upgrade the prompt a bit to improve performance for specific kinds of queries, but nothing crazy.

Cost me $10-20/mo to run for the low/moderate use by me and a few friends.

Interestingly, for people who didn't know its limitations / how to break it, it was basically passing the turing test. ChatGPT is inhumanly wordy, whereas GPT3 can actually be much more concise when prompted to do so. If, instead of prompting it that it is an AI assistant, you prompt it that it is a close friend with XYZ personality traits, it does a very good job of carrying on a light SMS conversation.

>If [...] you prompt it that it is a close friend with XYZ personality traits

A couple years ago a friend and I trained GPT-2 on our WhatsApp chat history. GPT-2 was more primitive, but it still managed to capture the gist of our personalities and interests, which was equal parts amusing and embarrassing.

We'd have it generate random chats, or ask it questions to see what simulated versions of ourselves would say.

I half remember one of Google’s many chat apps having an AI assistant a number of years ago (Allo maybe?)
They also did reinforcement learning on top of a frozen trained model. It is considerably more than just attaching a UI as that would just finish sentences compared to answering questions. https://huggingface.co/blog/rlhf
This is a remarkably good take that just didn't dawn on me until I read your comment. Even if ChatGPT had a lesser quality than the current iteration, the fact that they had a way for anyone to easily interact with it really was a homerun, snd can be for any software, really.
My family told me that ChatGPT came up from the pulpit AT CHURCH

Me? I made a few comments like a scared luddite when ChatGPT solved two of my outstanding engineering problems instantly.

I got better. But this is exactly right. The world in general now knows about AI and ML. It’s a pivot point.

When something scares a seasoned engineer for a minute, and anyone can now make use of this… write it down in your diary as a moment in history

I wonder what kind of engineering problems you’ve talked with ChatGPT. Because its limited understanding of spatial problems, I found it hard to use it for mechanical engineering. How did you do it?
I wrote another response earlier, but found myself ranting ;)

ChatGPT is currently best at things programmers would think about. You’re correct about spatial reasoning. But try stuff like this:

“Write a python program that calculates the static forces on a cantilevered ledge 15 feet long, with a support beam”

Haha it took the longest I’ve ever seen. You may have a point. It’s really good at writing code though.

Caution. I tried my example with matlab instead of python, and I think I may have set a server rack on fire ;)

One of the important parts of ChatGPT over plain GPT-3 is the reinforcement learning from human feedback to ensure alignment, without which it's not quite as good of a product for the public.
It's not just the UI. ChatGPT (which is further finetuned and uses RLHF) definitely produces better output than GPT3, especially without prompt engineering.
Some evidence to confirm this:

1. Twitter thread with examples: https://twitter.com/sjwhitmore/status/1601254826947784705

2. Tweet/screenshot + Colab notebook:https://twitter.com/aman_madaan/status/1599549721030246401, https://tinyurl.com/codex-chat-gpt

The second tweet is mine.

No. ChatGPT’s UI is incredibly simple and basically exactly what ever chat bot test repl looks like.

The delta of GPT3 -> ChatGPT is from the expanded context and control the model offers through fine tuning. Eg read the instructgpt paper to see the path on the way to ChatGPT.

Well yes, having no context memory, being slightly worse and requiring either a monster rig to run or paying per prompt made it completely and utterly irrelevant.

Even now that it's improved and free to use its actual practical usability is marginal at best given the rate of blatantly wrong info being spewed with 105% confidence at the moment.

> blatantly wrong info being spewed with 105% confidence

There are some approaches. For example in this paper they say truth has a certain logical consistency that is lacking in hallucinations and deception. So they find this latent direction that indicates truth in a frozen LLM. This actually works better than asking the model to self evaluate by text generation, or training with RLHF.

"Discovering Latent Knowledge in Language Models Without Supervision" https://arxiv.org/abs/2212.03827

There's also a video with the first author: "Making LLMs Say The Truth" https://www.youtube.com/watch?v=XSQ495wpWXs&t=1515s

Btw, I think this is one of the deepest discussions about LLM hallucinations and alignment I ever saw. Worth a watch, even if it is a bit long. Not every day something like this comes long.

Very interesting video - thanks for posting that.

It makes you wonder what other abstract concepts current models may have had to learn to get as good as they are. If they're doing a good job of modelling when someone is speaking the truth, then what else have they learnt about us?

How complete of a "world model" can you learn purely in a passive way by consuming whatever online text is available to train on, or maybe by consuming all existent written material were it to be digitized? At some point I'm sure you need to be able to interact with the world to test hypothesis etc, but how far can predictive "intelligence" go without that?

I dunno, I played a lot of AI Dungeon when it was GPT3 powered. It was quite nonsencial sometimes. ChatGPT is much better due to reinforcement learning and optimisation they did with it