Hacker News new | ask | show | jobs
by cztomsik 1247 days ago
I believe it's because you train it in GPT-mode and then only use RNN-mode for inference.