Y
Hacker News
new
|
ask
|
show
|
jobs
by
britmob
2229 days ago
Have I been using gpt-2-simple wrong..? I’ve been fine-tuning 355M on a 8GB 1080 for months..
1 comments
minimaxir
2229 days ago
gpt-2-simple has gradient checkpointing; aitextgen does not (yet).
link