Hacker News new | ask | show | jobs
by mmmmmbop 1173 days ago
The combination of gradient descent and transformers is a weird choice to ask founders about. One is an almost trivial algorithm, whereas the other is a cutting-edge research result.