Hacker News new | ask | show | jobs
by antonvs 216 days ago
They claim they don't use quantization.

The reason for their speed is this chip: https://www.cerebras.ai/chip