| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by bbotond 1124 days ago
	Subscribe, try GPT-4 and never look back.

4 comments

bugglebeetle 1124 days ago

GPT-4 has been made worse in the ChatGPT UI as well since the May update. It makes many more strange errors and has trouble reasoning around complex problems and ambiguity. Prompts similar to stuff that worked fine for me last month now require multiple iterations of feedback. I’d switch to using the API, but I’d go over the equivalent $20 of usage pretty quickly.

chaxor 1124 days ago

The models tend to degrade when trained to be safer.

A GPT-4 talk on youtube by personnel from Microsoft has documented this phenomenon with the 'Tikz Unicorn' evolution shown in the GPT-4 technical paper. The model gets qualitatively better with more training, and then degrades when trained to be safer (against racism sexism, etc), but it is not entirely clear why. These would seem very unrelated, especially when considering work done in LM editing (ROME/MEMIT) and the decent localization of knowledge seen there.

So, perhaps both the "I'm sorry I can't..." and 'strange errors' are not entirely orthogonal.

carrolldunham 1124 days ago

To me it is clear why. Imagine someone told you "answer immediately, top of your head: what's the best seasoning?". You'd just blurt out whatever specific you associated with pleasing seasoning (and that would be a good answer). Now imagine someone said "answer immediately, off the top of your head, but without offending any culture, gender, without a cultural bias, and without being presumptuous of the listener's socio-economic status (and if you fail one of these, someone dies) what's the best seasoning?" Even without the way that is going to lead to all sorts of compromising and second guessing in the answer space, simply only a fraction now of your brain is left to associate about the question due to just holding all that other stuff in there.

tempaccount420 1124 days ago

It seems pretty logical to me. Fine-tuning to make it more polite is giving it questions and punishing for giving an actual answer.

ChatGTP 1124 days ago

Probably not unlike people then. If you tell the truth you’ll be more often than not punished for it if you’re not very careful.

ChatGTP 1124 days ago

Probably not unlike people then. If you tell the truth you’ll be more often than not punished for it if you’re not very careful.

I find capitalism idiotic and broken, but I’m rarely allowed to say it, even if many people secretly agree with me, it might mean I’m a “communist” :)

aatd86 1124 days ago

I was just asking myself that yesterday...

Reasoning has degraded. To the point it was sometimes weirdly losing context and hallucinating...

Like its brain got fatigued or something...

abnercoimbre 1124 days ago

Do we know if this is exclusive to ChatGPT? Is the API exempt from this issue?

jablongo 1124 days ago

Yes! I’ve noticed this too, it’s just slightly less sharp. It probably has to do with how much trouble they are having servicing all of the demand, so they have rolled out a scaled down version that requires less compute.

avereveard 1124 days ago

Nah. Most of the response are as an ai language model I can't, even if you ask for information you provided.

The API is where it's at. There are wrappers on it that create the same chat look and feel, that can run on vercel or other very low cost providers, some with simpler UI, some with more features,some replicating the UI exactly.

wafflemaker 1124 days ago

Can you name or maybe even link some of these wrappers?

avereveard 1124 days ago

https://github.com/cogentapps/chat-with-gpt and https://github.com/ShipBit/slickgpt come to mind.

smeagull 1124 days ago

As an AI language model I cannot subscribe to GPT-4.

jaimex2 1124 days ago

Ah, the old bait and switch.

soulofmischief 1124 days ago

From what? A free product? Do you know how much compute it takes to run a single request?

smashed 1124 days ago

I don't but I'd like to know.

I was under the impression that it was mostly GPU vram based but once the model is loaded, it could produce output quickly? I'm probably over-simplifying things...

soulofmischief 1124 days ago

gpt-3.5-turbo (default ChatGPT model) takes 8 A100s, ~$10k each. [0]

The latest gpt-3.5-turbo model generates very quickly and cheaply (in part to some recently-discoverd optimization techniques... older versions cost 10x more). While the required hardware to run GPT-4 is currently unknown, it generates considerably slower on average and its much higher cost points to a higher hardware cost.

And this is per request. It's bananas.

[0] https://www.servethehome.com/chatgpt-hardware-a-look-at-8x-n...

jaimex2 1120 days ago

I'm not arguing or complaining.

Just highlighting the tactic :)

tamrix 1124 days ago

It feels like they've scale back how much ram must be used for gpt3 to give more to gpt4 playing users.