Hacker News new | ask | show | jobs
by wongarsu 1201 days ago
RAM, running entirely on the CPU at around 1.7 seconds per token