Hacker News new | ask | show | jobs
by checker659 199 days ago
> The fundamentals actually haven't changed that much in the last 3 years

Even said fundamentals don't have much in the way to foundations. It's just brute forcing your way using a O(n^3) algorithm using a lot of data and compute.

2 comments

Brute force!? Language modeling is a factorial time and memory problem. Someone comes up with a successful method that’s quadratic in the input sequence length and you’re complaining…?
O(n^(~2.8)) because fast matrix mult?