Hacker News new | ask | show | jobs
by 1x_engineer 187 days ago
Brute force!? Language modeling is a factorial time and memory problem. Someone comes up with a successful method that’s quadratic in the input sequence length and you’re complaining…?