Hacker News new | ask | show | jobs
by baq 834 days ago
They’re trained to optimize guessing the next word. What they solve for to get this good at predicting the next word is an open question with answers hidden in plain sight in the weight blob.