Hacker News new | ask | show | jobs
by minimaxir 2229 days ago
gpt-2-simple has gradient checkpointing; aitextgen does not (yet).