Hacker News new | ask | show | jobs
by britmob 2229 days ago
Have I been using gpt-2-simple wrong..? I’ve been fine-tuning 355M on a 8GB 1080 for months..
1 comments

gpt-2-simple has gradient checkpointing; aitextgen does not (yet).