Hacker News new | ask | show | jobs
by xanderlewis 815 days ago
Which breakthrough in the last two years are you referring to?
2 comments

If you had to reduce it to one thing, it's probably that language models are capable few shot and zero shot learners. In other words, training a model to simply predict the next word on naturally occurring text, you end up with an tool you can use for generic tasks, roughly speaking.
It turns out a lot of tasks are predictable. Go figure.
the LLM scaling law