Hacker News new | ask | show | jobs
by mountainriver 1537 days ago
We have not come to that consensus and large language models display really interesting capabilities like few shot learning, which before we thought would require a widely different architecture