Hacker News new | ask | show | jobs
by rrsp 999 days ago
'Most of the layers within each decoder block have names like gpt2_transformer_layer_3d'