Hacker News new | ask | show | jobs
by nefitty 1842 days ago
If you’re also curious how it does vs GPT3 here’s the Zero-shot performance evaluations table: https://github.com/kingoflolz/mesh-transformer-jax#zero-shot...