Show HN: I ran a language model on a PS2

Y	Hacker News new \| ask \| show \| jobs

Show HN: I ran a language model on a PS2 (github.com)

46 points by xaskasdf 90 days ago

The Emotion Engine has 32 MB of RAM total, so the trick is streaming weights from CD-ROM one matrix at a time during the forward pass — only activations, KV cache and embeddings live in RAM. This means models bigger than the RAM can still run, they just read more from disc.

Had to build a custom quantized format (PSNT), hack endianness, write a tokenizer pipeline, and most of the PS2 SDK from scratch (releasing that separately). The model itself is also custom — a 10M param Llama-style architecture I trained specifically for this.

And it works. On real hardware.

7 comments

randkyp 87 days ago

Neat! While the physicality of having the CD spin while running inference is undeniably cool, I wonder if you could run larger models at higher speeds through the PS2 HDD accessory/Memory Card Micro SD adapter/the PS2's USB port.

I doubt the VUs can help with inference given their small scratchpad sizes and instruction set though, haha.

link

accrual 87 days ago

The PS2's USB port is limited to 1.1 speeds so unfortunately it's much slower than the CD interface. The phat models have an internal IDE port that is trivally converted to SATA though, and is plenty fast with an SSD!

link

LocalH 86 days ago

The network port is faster than the CDVD drive or any of those accessories with the exception of the HDD. The ethernet PHY links at 100Mbit, but the processors inside the PS2 are not really capable of pushing that speed, the best I ever saw when installing games over the network with a hyper-optimized IP stack (on the IOP, IIRC) was something like 5MiB/s.

The HDD is the fastest form of I/O one can use on the PS2. It might not even need to be modified - depending on how well it's coded, it may be possible to run this software via Open PS2 Loader, which will replace CDVDMAN with a custom version that will access USB/ETH/HDD (and as mentioned in sibling comments, USB on the PS2 is version 1.1 and is much slower than even the CDVD drive).

Both network and HDD will also greatly minimize the cost of seeking the CDVD, which may be an issue depending on how the CD is laid out. CD access is up to 24x, DVD-ROM access is at 4x. DVD is thus slightly faster, and can be further increased by pushing the used data to the edge of the disc via a dummy file (traditionally, developers and game modders used Sony's own CD/DVD Generator software to determine the order of files being added to the disc, thus allowing the boot files to come first, followed by the dummy file, then any data files that need the extra speed).

link

mghackerlady 87 days ago

I'm excited for the PS2 SDK. Currently there isn't a lot in that space that won't get you sued

link

pjmlp 87 days ago

Some of us have it legally via PS2Linux, naturally distribution isn't allowed.

link

mghackerlady 86 days ago

Right but that's just developing for an old version of linux running on weak hardware, you can't do any of the crazy stuff the EE is really capable of. Plus, development is still hard for newcomers. I wish RenderWare was open sourced when it died instead of bitrotting

link

pooparse 87 days ago

IIRC the EE had some interesting hardware with vector units. Were these of any use/benefit here?

link

keremimo 87 days ago

My goodness... Is nothing sacred anymore?

link

mememememememo 90 days ago

How many tok/hr?

link

Real_Egor 87 days ago

Now you must teach it how to play Multiplayer AoE:II!

link

SachitRafa 90 days ago

The CD-ROM streaming approach is the real insight here, keeping only activations and KV cache in RAM and streaming weights one matrix at a time sidesteps the 32MB constraint entirely. It's essentially the same trick modern edge inference does with flash storage, just on hardware from 2000. Curious about the latency profile, with CD-ROM read speeds around 1.6 MB/s on PS2, the 77MB SmolLM2 model being too slow makes sense, but how does the 10MB brandon-tiny feel in practice? Are you getting tokens per minute or more like tokens per several seconds? Also interested in the custom PSNT format decision, was the main motivation the PS2's MIPS alignment constraints, or was there something about the existing GGUF/llama.c formats that made them impractical to parse on the Emotion Engine?

link