Hacker News new | ask | show | jobs
by Tiberium 1183 days ago
> and it crashed

I hope you understand that it's only because of inability of OpenAI's servers to keep up with demand or some issue in their backend code - language models themselves can't "crash" like normal programs on some kind of input, because they "just" generate new tokens.

2 comments

I know little more than what I've read about LLMs and other models. I just got it to do them again and it output them rather quickly: https://imgur.com/a/jfviCCe
are the numbers correct?
I spot checked an random assortment of them and so far yes, but it only takes one wrong one to set every subsequent number off.
> it only takes one wrong one to set every subsequent number off.

Except its not actually doing that calculation, so one wrong one shouldn't truly affect the rest like "Real" math.

> language models themselves can't "crash"

That is like saying the Excel document didn't crash, but Excel did when it tried to parse it. As far as I know there is no proof that you can't cause a LLM to crash with user input.

> because they "just" generate new tokens.

I can write a program that counts to 100 that crashes reliably.

> As far as I know there is no proof that you can't cause a LLM to crash with user input.

Well, then you simply don't understand how they work.

It's a fixed number of matrix multiplications. You can't make matrix multiplication crash by feeding it any sort of strange numbers (unless they haven't handled their Infs and NaNs).
> You can't make matrix multiplication crash by feeding it any sort of strange numbers

Matrices don't multiply themselves. You need hardware and software. As I pointed out the LLM is effectively just data that is being processed by a program. It is silly to assume you have no bugs in that software or the underlying operating system.

The type of bug the first comment was discussing, where recursion could cause it to crash (out of memory? stack overflow?) is categorically impossible based on how the model works, and a bug in the software running it would not change that.