Hacker News new | ask | show | jobs
by murkt 69 days ago
With 9M params it just repeats the joke from a training dataset.